VibeToText

Voice-to-text for developers featuring AI-powered cleanup and detailed analytics.

Components

Component	Directory	Stack
Python CLI	`src/vibetotext/`	Python 3.9+, Whisper.cpp, sounddevice, pynput
Windows native app	`windows-native/`	C# .NET 9, WPF, NAudio, Whisper.net
macOS native app	`macos-native/`	Swift 5.9, Metal, macOS 14+
Electron history app	`history-app/`	Electron 28, better-sqlite3, D3.js

All implementations share the same SQLite database at ~/.vibetotext/history.db.

Features

Multi-Mode Hotkeys

Ctrl+Shift — Raw transcription
Cmd+Shift — Greppy mode with semantic code search
Alt+Shift — Cleanup mode (AI refines rambling into clear prompts)
Cmd+Alt — Plan mode (generates structured implementation plans)

Fast Local Transcription

Whisper.cpp for 2-4x faster transcription than Python Whisper
Technical vocabulary bias for programming terms
Auto-paste to cursor

Analytics & Settings

Press Cmd+Comma (macOS) or Ctrl+Comma (Windows) to open the History & Settings window.

Streaks & Personal Records — Track your current streak, best WPM, most words/day, and longest session.
Topic Speed & Mood — See how fast and positive you are across topics like Testing, Planning, Documentation, and more. Bar colors shift from negative to positive sentiment.
Daily Goal Progress — Set daily and weekly word targets and track completion.
Activity Heatmap — GitHub-style hourly/yearly view of when you dictate most.
Peak Hours & Words Over Time — Visualize your productivity patterns and dictation volume trends.
Filler Words & Vocabulary Diversity — Monitor filler word usage and track your unique word count and richness score.
Recent History — Review and copy previous transcriptions.
Microphone Selection — Switch audio input devices directly from the UI.

Cosmic Visualization (macOS only)

A living 3D world that reacts to your voice in real time. As you dictate, a procedural planet grows with villagers, buildings, crops, and a tree whose leaves are your most-used words. A cosmic entity watches from a black hole in the sky — and if your sentiment turns negative, it attacks.

Hotkeys

Cmd+Ctrl+G — Open the Word Galaxy visualization

How it works

Sentiment-driven behavior — Your words are analyzed in real time. Positive speech keeps the world peaceful; negative sentiment triggers the cosmic entity to charge and fire lasers at your village.
Procedural planet — Villagers (farmers, scholars, builders, guards) and buildings populate a 3D planet that grows as you talk.
Word tree — Your top 500 words are assigned to leaves on a procedural tree that grows during the intro sequence.
Word nebula — Recent transcriptions float as text in a nebula cloud. Common words migrate from the nebula to the tree.
Seasons & day/night — A 15-second day/night cycle with shifting sky colors, dynamic lighting, and fireflies at night.
GLB export — Export generated 3D entities for use in external tools.

Requires macOS 14+ (Sonoma). The cosmic visualization is part of the native macOS app in macos-native/, built with Swift and Metal. Not available on Windows or Linux.

Install

Python CLI

pip install -e .
pip install -e ".[gemini,dev]"   # with Gemini + dev dependencies

Optionally set GEMINI_API_KEY in a .env file to enable cleanup/plan modes. You can copy the .env.example file and then add your key.

Windows Native App

cd windows-native
build.bat                        # or: dotnet build src/VibeToText/VibeToText.csproj

macOS Native App

cd macos-native
swift build

Requires macOS 14+ (Sonoma) and Swift 5.9+.

Platform Builds (Python app)

These scripts package the Python CLI into standalone executables via PyInstaller:

# Windows (from project root) → dist/vibetotext-engine.exe, dist/vibetotext-ui.exe
build_windows.bat

# macOS
bash packaging/macos/build_macos.sh

# Linux
bash packaging/linux/build_linux.sh

Executables will be in the dist/ folder. See packaging/ for platform-specific configs.

Usage

vibetotext              # Start with default hotkeys
vibetotext --model base # Use specific Whisper model

Start/Stop with History Application

start-all.sh

stop-all.sh

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.claude		.claude
.github/workflows		.github/workflows
VibeToText.Windows		VibeToText.Windows
build		build
docs		docs
history-app		history-app
macos-native		macos-native
native-app		native-app
packaging		packaging
scripts		scripts
src/vibetotext		src/vibetotext
vibe-swift		vibe-swift
visualizations		visualizations
windows-native		windows-native
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
build_windows.bat		build_windows.bat
coarse-cosmic-entity.glb		coarse-cosmic-entity.glb
cosmic-entity-reference.png		cosmic-entity-reference.png
dev.py		dev.py
generate-cosmic-mesh.mjs		generate-cosmic-mesh.mjs
generate-reference-image.mjs		generate-reference-image.mjs
generate-trellis-mesh.mjs		generate-trellis-mesh.mjs
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
render_sphere.py		render_sphere.py
restart.sh		restart.sh
setup_app.py		setup_app.py
setup_windows_path.py		setup_windows_path.py
start-all.sh		start-all.sh
stop-all.sh		stop-all.sh
vibetotext-startup.sh		vibetotext-startup.sh
vibetotext-stop.sh		vibetotext-stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VibeToText

Components

Features

Analytics & Settings

Cosmic Visualization (macOS only)

Install

Python CLI

Windows Native App

macOS Native App

Platform Builds (Python app)

Usage

Start/Stop with History Application

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VibeToText

Components

Features

Analytics & Settings

Cosmic Visualization (macOS only)

Install

Python CLI

Windows Native App

macOS Native App

Platform Builds (Python app)

Usage

Start/Stop with History Application

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages