Skip to content

Commit 6054db9

Browse files
Update README and config for transcription languages and ASR model version
1 parent 847d734 commit 6054db9

2 files changed

Lines changed: 6 additions & 3 deletions

File tree

README.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,9 @@
11
# Matrix Voice Transcript
22

3-
Matrix bot that transcribes voice messages and audio files using [NVIDIA NeMo Parakeet TDT](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2) running locally on CPU. Supports E2EE rooms. No audio leaves the server.
3+
> **Transcription languages / Языки распознавания / Sprachen / Langues:**
4+
> English, Русский, Deutsch, Français, Español, Português, Polski, Czech, Slovak, Bulgarian, Ukrainian, Dutch, Italian, Romanian, Swedish, Danish, Finnish, Norwegian, Greek, Hungarian, Estonian, Latvian, Lithuanian, Slovenian, Croatian, Maltese
5+
6+
Matrix bot that transcribes voice messages and audio files using [NVIDIA NeMo Parakeet TDT](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3) running locally on CPU. Supports E2EE rooms. No audio leaves the server.
47

58
## Requirements
69

@@ -23,7 +26,7 @@ Matrix bot that transcribes voice messages and audio files using [NVIDIA NeMo Pa
2326
| `MATRIX_ACCESS_TOKEN` | Bot access token |
2427
| `MATRIX_PASSWORD` | Optional. Prunes stale E2EE devices on startup; required for reliable decryption in encrypted rooms. |
2528
| `LOCALE` | Message language: `en` (default) or `ru` |
26-
| `ASR_MODEL_NAME` | NeMo model (default: `nvidia/parakeet-tdt-0.6b-v2`) |
29+
| `ASR_MODEL_NAME` | NeMo model (default: `nvidia/parakeet-tdt-0.6b-v3`) |
2730
| `MAX_AUDIO_BYTES` | Max file size in bytes (default: `26214400` = 25 MB) |
2831
| `STORE_PATH` | Olm key store path inside the container (default: `/data/store`) |
2932

src/config.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ class Settings(BaseSettings):
1515

1616
locale: str = "en"
1717

18-
asr_model_name: str = "nvidia/parakeet-tdt-0.6b-v2"
18+
asr_model_name: str = "nvidia/parakeet-tdt-0.6b-v3"
1919

2020
max_audio_bytes: int = 25 * 1024 * 1024
2121

0 commit comments

Comments
 (0)