[RD-567] Add TTS evals by dbrkn · Pull Request #7 · dbrkn/OpenBench

dbrkn · 2026-02-13T18:01:08Z

Adds a speech_generation pipeline that generates audio from text prompts via whisperkit-cli tts, transcribes the output using the WhisperKitPro engine, and computes WER against the original prompt. Includes a text-only dataset, configurable TTS/transcription params, a registered alias
Adds a generic --pipeline-config key=value CLI flag for alias-mode overrides. ( mainly to set speakers and language for tts generation)

Sample command:

export WHISPERKIT_CLI_PATH="/path/to/whisperkit-cli"
export WHISPERKITPRO_CLI_PATH="/path/to/whisperkitpro-cli"
uv run openbench-cli evaluate \
  --pipeline whisperkit-speech-generation \
  --dataset customer-service-tts-prompts-vocalized \
  --metrics wer \
  --verbose

Sample Result:

Adds a new TTS evaluation pipeline using OpenAI's API to generate audio from text prompts, then transcribes with WhisperKitPro for WER. Made-with: Cursor

Adds a new TTS evaluation pipeline using ElevenLabs' API to generate audio from text prompts, then transcribes with WhisperKitPro for WER. Made-with: Cursor Co-authored-by: dberkin1 <berkin@argmax.com>

…erkin/tts-evals Made-with: Cursor # Conflicts: # src/openbench/pipeline/pipeline_aliases.py # src/openbench/pipeline/speech_generation/__init__.py

Adds a new TTS evaluation pipeline using Cartesia's API to generate audio from text prompts, then transcribes with WhisperKitPro for WER.

Adds a new TTS evaluation pipeline using Google's Gemini API to generate audio from text prompts, then transcribes with WhisperKitPro for WER.

) Adds multi-speaker dialogue TTS evaluation using ElevenLabs' text_to_dialogue API with chunking for long dialogues. Also includes: - Speech generation support for local datasets - Dialogue field in speech generation dataset schema - Empty dictionary guard in keyword boosting metrics

dberkin1 and others added 10 commits February 13, 2026 20:52

Add TTS evals

1e59781

refactor

7d3ab4d

reformatting

d418181

reformat

ef86575

Add OpenAI speech generation pipeline

936272f

Adds a new TTS evaluation pipeline using OpenAI's API to generate audio from text prompts, then transcribes with WhisperKitPro for WER. Made-with: Cursor

Add ElevenLabs speech generation pipeline (#8)

82f40c3

Adds a new TTS evaluation pipeline using ElevenLabs' API to generate audio from text prompts, then transcribes with WhisperKitPro for WER. Made-with: Cursor Co-authored-by: dberkin1 <berkin@argmax.com>

Merge remote-tracking branch 'origin/berkin/openai-speech-gen' into b…

3bc0813

…erkin/tts-evals Made-with: Cursor # Conflicts: # src/openbench/pipeline/pipeline_aliases.py # src/openbench/pipeline/speech_generation/__init__.py

Add Cartesia speech generation pipeline (#10)

7a8b2c3

Adds a new TTS evaluation pipeline using Cartesia's API to generate audio from text prompts, then transcribes with WhisperKitPro for WER.

Add Gemini speech generation pipeline (#11)

20a377f

Adds a new TTS evaluation pipeline using Google's Gemini API to generate audio from text prompts, then transcribes with WhisperKitPro for WER.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RD-567] Add TTS evals#7

[RD-567] Add TTS evals#7
dbrkn wants to merge 10 commits intomainfrom
berkin/tts-evals

dbrkn commented Feb 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dbrkn commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dbrkn commented Feb 13, 2026 •

edited

Loading