Voice Prompt

A lightweight cross-platform app that converts speech to text and processes it with AI. Perfect for crafting prompts for AI tools like Claude and ChatGPT - speak in your native language, get polished English output ready to use.

Installation

macOS (Recommended)

We strongly recommend using Homebrew for macOS. It handles installation, updates, and uninstallation automatically.

brew tap efeecllk/voice-prompt
brew install --cask voice-prompt

Works on both Apple Silicon (M1/M2/M3) and Intel Macs automatically.

Why Homebrew?

Automatic architecture detection (Apple Silicon vs Intel)
Easy updates with brew upgrade
Clean uninstall with brew uninstall
No manual DMG mounting needed

Note: Direct DMG downloads are available in Releases but are not recommended. Use Homebrew instead.

Windows

Download from Releases:

Voice.Prompt_x.x.x_x64.msi - Windows Installer (recommended)
Voice.Prompt_x.x.x_x64-setup.exe - NSIS Installer

Note: You may see a SmartScreen warning on first run. Click "More info" → "Run anyway".

Update

macOS:

brew update && brew upgrade --cask voice-prompt

Windows: Download the latest version from Releases.

Features

Core Features

Feature	Description
Menu Bar / System Tray	Lives in your menu bar (macOS) or system tray (Windows), always one click away
Voice Recording	One-click recording with visual feedback and waveform animation
Speech-to-Text	Powered by OpenAI Whisper API - industry-leading accuracy
Multi-Language	20+ source languages including auto-detect
Global Shortcut	Trigger recording from any app (default: `Cmd/Ctrl + Shift + Space`)
Dark Mode	Follows your system theme automatically

Output Formats

Format	Description
Standard Translation	Clean, accurate English translation
Formal English	Professional business English
Casual English	Friendly, conversational tone
Ralph Wiggum Mode	Transforms simple requests into detailed Claude Code prompts
Custom Formats	Create your own AI processing templates

Organization

Feature	Description
History	Access your last 20 translations
Favorites	Star important outputs for quick access
My Prompts	Save outputs to your personal prompt library
One-Click Copy	Copy any text instantly to clipboard
Send to Terminal	Paste prompts directly into Ghostty, Warp, iTerm2, or Terminal.app

Feature Details

1. Voice Recording

Click the microphone button or use the global shortcut to start recording. The app shows:

Recording duration timer
Live audio waveform visualization
Clear stop button

Recording stops automatically when you click stop, and processing begins immediately.

2. Output Format Selection

Choose how your speech is processed:

Default Translation - Translates to natural English
Formal Business - Professional tone for work contexts
Casual Conversation - Relaxed, friendly output
Ralph Wiggum Mode - Special mode that expands simple voice requests into detailed, well-structured prompts for Claude Code

3. Custom Output Formats

Create your own processing templates:

Go to Settings
In "Create Your Own Output Format", click the microphone icon
Describe what you want in your voice (e.g., "Convert my speech into git commit messages")
The AI generates a custom format with name, description, and system prompt
Optionally select an emoji icon for easy recognition
Click Save

Your custom format appears in the Output Format dropdown and shows in the header when active.

4. My Prompts Library

Save and organize your favorite outputs:

From Favorites: Click the document icon on any favorite to save it
Access: Click the document icon in the header to view My Prompts
Edit: Click any prompt to edit its name, description, or content
Copy: Copy individual fields (name, description, content) with one click
Delete: Remove prompts you no longer need

5. History & Favorites

History Tab: Shows your last 20 recordings with source and output
Favorites Tab: Shows starred items for quick access
Star Button: Click to add/remove from favorites
Copy Button: Copy the output text instantly
Save to My Prompts: Save favorites to your prompt library

6. Global Shortcut

Trigger Voice Prompt from any application:

Shortcut	Platform
`Cmd + Shift + Space`	macOS (default)
`Ctrl + Shift + Space`	Windows (default)

Change in Settings → Global Shortcut. Available options:

Cmd/Ctrl + Shift + Space
Cmd/Ctrl + Option/Alt + Space
Cmd/Ctrl + Shift + .
Cmd/Ctrl + Option/Alt + V

7. Send to Terminal (macOS)

Paste generated prompts directly into your terminal app without manual copy-paste:

The app auto-detects running terminals (Ghostty, Warp, iTerm2, Terminal.app)
Click the arrow button on any result to send it to your terminal
Or enable Auto-paste in Settings to send automatically after generation
Optionally enable Auto-submit to also press Enter after pasting

Supported terminals: Ghostty, Warp, iTerm2, Terminal.app

Setup: Grant Accessibility permission on first use (System Settings > Privacy & Security > Accessibility).

8. Theme Support

System (default) - Follows your OS appearance
Light - Always light mode
Dark - Always dark mode

Supported Languages

Language	Code	Language	Code
Auto-detect	auto	Korean	ko
Turkish	tr	Chinese	zh
English	en	Arabic	ar
Spanish	es	Hindi	hi
French	fr	Dutch	nl
German	de	Polish	pl
Italian	it	Swedish	sv
Portuguese	pt	Danish	da
Russian	ru	Norwegian	no
Japanese	ja	Finnish	fi
Greek	el

Configuration

API Key

Your OpenAI API key is stored securely on your device.

Get your API key from platform.openai.com
Open Voice Prompt → Settings
Paste your API key
Click Save

API Costs

Service	Cost
Whisper (Speech-to-Text)	~$0.006/minute
GPT-4.1-nano (Processing)	~$0.0001/request

Estimated: ~$0.05/day for 50 translations

Build from Source

Prerequisites

Node.js 20+
pnpm 9+
Rust stable
macOS: Xcode Command Line Tools
Windows: Visual Studio Build Tools with C++ workload

Steps

# Clone the repository
git clone https://github.com/efeecllk/voice-prompt.git
cd voice-prompt

# Install dependencies
pnpm install

# Run in development
pnpm tauri dev

# Build for production
pnpm tauri build

Tech Stack

Layer	Technology
Framework	Tauri 2.0
Frontend	React 18 + TypeScript
Styling	Tailwind CSS
State	Zustand
Speech-to-Text	OpenAI Whisper
Text Processing	OpenAI GPT-4.1-nano

Troubleshooting

macOS: "App is damaged" or "Cannot be opened"

xattr -cr /Applications/Voice\ Prompt.app

Windows: SmartScreen Warning

Click "More info" → "Run anyway". This appears because the app isn't code-signed.

API Key Not Working

Check your API key at platform.openai.com
Ensure you have billing enabled
Try generating a new key

Recording Not Working

Grant microphone permission when prompted
Check System Settings → Privacy & Security → Microphone (macOS)
Check Settings → Privacy → Microphone (Windows)

Send to Terminal Not Working (macOS)

Grant Accessibility permission: System Settings → Privacy & Security → Accessibility → Add Voice Prompt
Restart the app after granting permission
Check that the correct terminal is selected in Settings → Target Terminal

License

MIT License - see LICENSE for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

What's New in v0.3.0

Send to Terminal - Paste prompts directly into Ghostty, Warp, iTerm2, or Terminal.app
Auto-paste - Automatically send generated prompts to your terminal
Auto-submit - Optionally press Enter after pasting for hands-free workflow
Smart Terminal Detection - Auto-detects running terminals, prioritizes dev terminals
History Send - Send any history item directly to your terminal

What's New in v0.2.0

Windows Support - Full Windows 10+ support with native installer
Intel Mac Support - Now works on both Apple Silicon and Intel Macs
Custom Output Formats - Create your own AI processing templates with voice
My Prompts Library - Save and organize your favorite outputs
Favorites System - Star important translations for quick access

Made with Tauri

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.github/workflows		.github/workflows
assets		assets
docs/plans		docs/plans
scripts		scripts
src-tauri		src-tauri
src		src
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

Voice Prompt

Installation

macOS (Recommended)

Windows

Update

Features

Core Features

Output Formats

Organization

Feature Details

1. Voice Recording

2. Output Format Selection

3. Custom Output Formats

4. My Prompts Library

5. History & Favorites

6. Global Shortcut

7. Send to Terminal (macOS)

8. Theme Support

Supported Languages

Configuration

API Key

API Costs

Build from Source

Prerequisites

Steps

Tech Stack

Troubleshooting

macOS: "App is damaged" or "Cannot be opened"

Windows: SmartScreen Warning

API Key Not Working

Recording Not Working

Send to Terminal Not Working (macOS)

License

Contributing

What's New in v0.3.0

What's New in v0.2.0

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages