Gemini Live TTS Chrome Extension (Enhanced v2.4)

Major Update v2.4.0: Now supports 30 voices, TTS vs Native Audio switching, and Model Selection!

This extension enables you to use Google's Gemini Multimodal Live API (and TTS API) to read selected text on any webpage. It supports real-time streaming, image understanding (in Native Audio mode), and a wide range of natural voices.

✨ New Features in v2.4

🎙️ Two API Modes:
- Native Audio (Live API): Real-time, supports system prompts (translation, style change), understands images. expensive but powerful.
- Text-to-Speech (TTS): Cheaper, standard text reading. Ideal for long articles.
🤖 Model Selection: Choose between gemini-2.5-flash, gemini-2.5-pro and other preview models.
🗣️ 30+ Voices: Full support for all Gemini voices including Kore, Fenrir, Aoede, Charon, and more.
⚡ Fixed: Resurrected the extension after Google deprecated old ephemeral models.

🚀 Installation

Clone or download this repository.
Open Chrome and navigate to chrome://extensions.
Enable Developer Mode (top right).
Click Load unpacked and select the extension folder.

⚙️ Configuration

Get your API Key from Google AI Studio.
Click the extension icon and select Options.
Enter your API Key.
Select API Type (Native Audio or TTS).
Choose your favorite Voice and Model.
(Optional) Set a System Prompt (e.g., "Translate to Spanish and read").

📝 Usage

Select text on any webpage.
Right-click and choose "Transcribe with Gemini".
(Or click the extension icon to take a screenshot and have Gemini describe/read it - Native Audio only).

🛠️ Tech Stack

Chrome Extension Manifest V3
Gemini Live API (WebSocket)
Gemini TTS API (REST)
Native Audio / Web Audio API

Credits

Original extension by jansenmtan — Chrome Web Store · GitHub. Fixes and v2.0 updates by tomfalkenberg. Major v2.4 overhaul (UI, Voices, TTS support) by StrangeTeaCreature.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
audioStreamer.js		audioStreamer.js
background.js		background.js
content.js		content.js
defaultPrompt.js		defaultPrompt.js
errors.js		errors.js
example-image-1.png		example-image-1.png
example-image-2.png		example-image-2.png
example-playback-controls.png		example-playback-controls.png
example-text.png		example-text.png
faq.html		faq.html
icon-16.png		icon-16.png
icon-32.png		icon-32.png
manifest.json		manifest.json
offscreen.html		offscreen.html
offscreen.js		offscreen.js
onboarding.html		onboarding.html
options.html		options.html
options.js		options.js
page_action-16.png		page_action-16.png
page_action-32.png		page_action-32.png
popup.html		popup.html
popup.js		popup.js
request.raw		request.raw
screenshotSelection.js		screenshotSelection.js
silence.raw		silence.raw
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemini Live TTS Chrome Extension (Enhanced v2.4)

✨ New Features in v2.4

🚀 Installation

⚙️ Configuration

📝 Usage

🛠️ Tech Stack

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gemini Live TTS Chrome Extension (Enhanced v2.4)

✨ New Features in v2.4

🚀 Installation

⚙️ Configuration

📝 Usage

🛠️ Tech Stack

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages