A desktop app for transcribing Ghanaian audio files using Gemini AI.
- Python 3.8 or newer
- Git (for automatic setup)
- An internet connection
Linux — install these once:
sudo apt install python3-tk xclip wl-clipboard gitWindows / Mac — Python and Git are all you need.
git clone https://github.com/GhanaNLP/transcriber.git
cd transcriberpython transcriber.pyOn first launch the app automatically installs everything it needs (huggingface_hub and the language validator). This takes about 30 seconds and only happens once.
A setup screen will appear asking for three things:
- Volunteer code — paste the code you were sent
- Audio folder — browse to the folder where you extracted your zip
- Output folder — choose where your transcript files will be saved
Click Start →. Your code is remembered for future sessions — you won't need to paste it again.
- The app shows one audio file at a time
- Click ⎘ Copy audio file → open gemini.google.com → start a new chat → upload the file
- Click ✦ Prompt 1 (News) or ✦ Prompt 2 (Conversational) to copy the right prompt → paste it into Gemini and send
- Copy Gemini's response → click back on the app → paste into the text box
- The app checks and saves automatically:
- Repeated sentences are removed
- Transcripts that are too short or too long are flagged
- Transcripts that don't match your assigned language are flagged — this helps catch paste errors
- Every 10 transcriptions your work is uploaded automatically
Which prompt?
- Prompt 1 (News) — formal news broadcast audio
- Prompt 2 (Conversational) — informal or mixed speech
If one gives a short or poor result, try the other. Still poor → click Skip ⇥.
The app automatically checks that each transcript matches the language you were assigned. If it doesn't:
- Yellow warning — the match is low but could be fine (e.g. heavy English mixing). You can still save.
- Red block — the transcript looks completely wrong. Re-paste from Gemini or skip the file.
This catches accidental wrong pastes, not genuine mixed-language speech.
Click Skip ⇥ if you can't get a valid transcript after a few attempts. The file won't appear again. To undo a skip, remove its line from skipped.log in your transcripts folder.
Your results upload automatically every 10 transcriptions. To upload immediately, click ⬆ Push Now. When you finish all your files the app pushes anything remaining automatically.
App won't open on Linux
sudo apt install python3-tk"Assignment mismatch" on startup Make sure you selected the folder where you extracted your zip file, not a different one.
Push failed The app retries automatically next time you save. You can also click ⬆ Push Now once your connection is back.
Lost your code Contact the project coordinator to get it resent. Once entered it will be saved for future sessions.