Python implementation of the NIST md-eval.pl script for evaluating rich transcription and speaker diarization accuracy.
-
Updated
Jan 12, 2026 - Python
Python implementation of the NIST md-eval.pl script for evaluating rich transcription and speaker diarization accuracy.
An English-Spanish code switching dataset adapted from the Miami-Corpus
Fully automated multi-speaker transcription tool built off of the WhisperX library with Pyannote. Comes with Word Alignment, Timestamps, Voice activity Detection and Speaker Diarization. Mathematically optimised for accuracy.
A Python CLI tool for transcribing podcasts with automatic speaker diarisation using OpenAI's Whisper API. Features real-time progress tracking, cost estimation, automatic chunking for large files, and clean Markdown output with timestamps. Supports both URLs and local files. Python 3.10+ compatible.
Add a description, image, and links to the diarisation topic page so that developers can more easily learn about it.
To associate your repository with the diarisation topic, visit your repo's landing page and select "manage topics."