Skip to content

Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing#4889

Merged
davidzhao merged 12 commits intolivekit:mainfrom
jiegong-fde:jieg/update_b10_plugin
Mar 29, 2026
Merged

Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing#4889
davidzhao merged 12 commits intolivekit:mainfrom
jiegong-fde:jieg/update_b10_plugin

Conversation

@jiegong-fde
Copy link
Copy Markdown
Contributor

@jiegong-fde jiegong-fde commented Feb 18, 2026

Summary

Updates the Baseten STT plugin to align with Baseten's current Streaming Transcription API, adds ergonomic endpoint configuration via model_id / chain_id parameters, and significantly expands the README documentation.

Motivation: The existing plugin had several issues that prevented it from working correctly with Baseten's streaming ASR API:

  • Wrong metadata field names — The plugin sent vad_params and streaming_whisper_params in the WebSocket metadata, but Baseten's StreamingWhisperInput schema (which uses extra="forbid") expects whisper_params, streaming_params, streaming_vad_config, and streaming_diarization_config. This caused the connection to be rejected outright.

  • No chain deployment support — Users had to manually construct WebSocket URLs. There was no way to specify a chain ID, which is the recommended deployment type for Baseten's streaming ASR.

  • Missing streaming parameters — Options like enable_partial_transcripts, partial_transcript_interval_s, show_word_timestamps, and final_transcript_max_duration_s were not exposed, limiting configurability.

  • Incomplete response parsing — The plugin didn't handle chain responses (which lack a top-level transcript field and include a "type": "transcription" wrapper), nor did it extract word-level timestamps from word_timestamps within segments.

@CLAassistant
Copy link
Copy Markdown

CLAassistant commented Feb 18, 2026

CLA assistant check
All committers have signed the CLA.

@jiegong-fde jiegong-fde force-pushed the jieg/update_b10_plugin branch from d6e828c to 8e1c5a2 Compare February 18, 2026 22:45
@jiegong-fde jiegong-fde changed the title Jieg/update b10 plugin Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing Feb 18, 2026
@jiegong-fde jiegong-fde marked this pull request as ready for review February 18, 2026 23:14
devin-ai-integration[bot]

This comment was marked as resolved.

@jiegong-fde jiegong-fde force-pushed the jieg/update_b10_plugin branch 2 times, most recently from 26364d0 to a281e84 Compare February 18, 2026 23:28
@jiegong-fde
Copy link
Copy Markdown
Contributor Author

Ready for review.

@jiegong-fde jiegong-fde reopened this Feb 20, 2026
@jiegong-fde
Copy link
Copy Markdown
Contributor Author

recheck

@jiegong-fde
Copy link
Copy Markdown
Contributor Author

jiegong-fde commented Feb 20, 2026

A side note: I've signed the cla but somehow it shows unsigned
image

@jiegong-fde jiegong-fde force-pushed the jieg/update_b10_plugin branch from 3f831e4 to c0e59c0 Compare February 26, 2026 04:56
devin-ai-integration[bot]

This comment was marked as resolved.

devin-ai-integration[bot]

This comment was marked as resolved.

Copy link
Copy Markdown
Member

@tinalenguyen tinalenguyen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tested it and left a few minor comments, otherwise LGTM, thank you for the PR!

Comment thread livekit-plugins/livekit-plugins-baseten/livekit/plugins/baseten/stt.py Outdated
Comment thread livekit-plugins/livekit-plugins-baseten/livekit/plugins/baseten/stt.py Outdated
Comment thread livekit-plugins/livekit-plugins-baseten/livekit/plugins/baseten/stt.py Outdated
@davidzhao davidzhao merged commit dfcd051 into livekit:main Mar 29, 2026
13 checks passed
osimhi213 added a commit to de-id/livekit-agents that referenced this pull request Mar 29, 2026
…endor

* upstream/main:
  Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing (livekit#4889)
  feat(anthropic): support strict tool use schema (livekit#5259)
  feat(mistral): add voxtral TTS support (livekit#5245)
  Fix/stt fallback adapter propagate aligned transcript (livekit#5237)
  fix: expose endpointing_opts in AgentSession.update_options() (livekit#5243)
  fix(utils): improve type annotation for deprecate_params decorator (livekit#5244)
  (google realtime): add gemini-3.1-flash-live-preview model (livekit#5233)
  fix: Nova Sonic interactive context bugs and dynamic tool support (livekit#5220)
  feat(assemblyai): add domain parameter for Medical Mode (livekit#5208)
  fix: ensure MCP client enter/exit run in the same task (livekit#5223)
  feat(google): add VertexRAGRetrieval provider tool (livekit#5222)
osimhi213 added a commit to de-id/livekit-agents that referenced this pull request Mar 29, 2026
* upstream/main:
  Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing (livekit#4889)
  feat(anthropic): support strict tool use schema (livekit#5259)
  feat(mistral): add voxtral TTS support (livekit#5245)
  Fix/stt fallback adapter propagate aligned transcript (livekit#5237)
  fix: expose endpointing_opts in AgentSession.update_options() (livekit#5243)
  fix(utils): improve type annotation for deprecate_params decorator (livekit#5244)
  (google realtime): add gemini-3.1-flash-live-preview model (livekit#5233)
  fix: Nova Sonic interactive context bugs and dynamic tool support (livekit#5220)
  feat(assemblyai): add domain parameter for Medical Mode (livekit#5208)
  fix: ensure MCP client enter/exit run in the same task (livekit#5223)
  feat(google): add VertexRAGRetrieval provider tool (livekit#5222)
osimhi213 added a commit to de-id/livekit-agents that referenced this pull request Mar 29, 2026
* feat(google): add VertexRAGRetrieval provider tool (livekit#5222)

* fix: ensure MCP client enter/exit run in the same task (livekit#5223)

* feat(assemblyai): add domain parameter for Medical Mode (livekit#5208)

* fix: Nova Sonic interactive context bugs and dynamic tool support (livekit#5220)

Co-authored-by: Pavas Kant <pavkan@amazon.com>

* (google realtime): add gemini-3.1-flash-live-preview model (livekit#5233)

* fix(utils): improve type annotation for deprecate_params decorator (livekit#5244)

* fix: expose endpointing_opts in AgentSession.update_options() (livekit#5243)

* Fix/stt fallback adapter propagate aligned transcript (livekit#5237)

* feat(mistral): add voxtral TTS support (livekit#5245)

* feat(anthropic): support strict tool use schema (livekit#5259)

* Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing (livekit#4889)

---------

Co-authored-by: Yousuf Bukhari <25112850+youpesh@users.noreply.github.com>
Co-authored-by: Long Chen <longch1024@gmail.com>
Co-authored-by: Martin Schweiger <34636718+m-ods@users.noreply.github.com>
Co-authored-by: Osman-AGI <uyguripek@gmail.com>
Co-authored-by: Pavas Kant <pavkan@amazon.com>
Co-authored-by: Tina Nguyen <72938484+tinalenguyen@users.noreply.github.com>
Co-authored-by: Milad <129620931+miladmnasr@users.noreply.github.com>
Co-authored-by: Jean Perbet <jeanperbet@icloud.com>
Co-authored-by: Shaik Faizan Roshan Ali <roshan.shaik.ml@gmail.com>
Co-authored-by: jiegong-fde <jie.gong@baseten.co>
russellmartin-livekit pushed a commit that referenced this pull request Apr 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants