EchoVeil Server

Companion server for the EchoVeil Chrome extension. Runs locally and performs real-time AI voice conversion using Seed-VC and optional audio-driven face animation using MuseTalk.

The extension captures tab audio, streams it to this server over WebSocket, and plays back the converted voice in real time. All processing happens on your machine — nothing leaves localhost.

Requirements

GPU: NVIDIA with CUDA (Windows) or Apple Silicon (macOS). 4+ GB VRAM recommended (8+ GB if using face animation).
~5 GB disk space for voice conversion, ~9 GB with face animation models

Install (Windows)

Download the latest installer from Releases and run it.

The installer will:

Install an embedded Python 3.12
Download Seed-VC and MuseTalk source code
Download MuseTalk model weights (~4 GB)
Install PyTorch with CUDA and all dependencies
Register the echoveil:// URI protocol so the extension can auto-launch the server

After installation, the server can be launched from the Start Menu or automatically by the extension.

Install (macOS)

There is no packaged installer for macOS yet. The server runs fine on Apple Silicon (M1/M2/M3/M4) via MPS — you just need to set it up manually.

# Install Python 3.12 (if you don't have it)
brew install python@3.12

# Clone and set up
git clone https://github.com/mexican75/echoveil-server.git
cd echoveil-server

# Install PyTorch (MPS support is included by default)
pip3 install torch torchaudio torchvision

# Clone seed-vc
git clone --depth 1 https://github.com/Plachtaa/seed-vc.git seed-vc

# Clone MuseTalk (optional — for face animation)
git clone --depth 1 https://github.com/TMElyralab/MuseTalk.git musetalk

# Download MuseTalk model weights (optional — ~4 GB)
python3 download_musetalk_models.py

# Install remaining dependencies
pip3 install -r requirements.txt

# Patch BigVGAN
python3 patch_bigvgan.py

# Run the server
python3 main.py

The extension cannot auto-launch the server on macOS (no echoveil:// URI protocol registered). You need to start it manually before using the extension. The server will auto-shut down after 10 minutes of inactivity.

Intel Macs fall back to CPU inference, which will be too slow for real-time use.

How It Works

The extension detects the server isn't running and opens echoveil://launch
The system tray launcher starts and loads ML models (~30s on first run, cached after)
Audio streams over WebSocket at ws://127.0.0.1:8765/ws/convert
The server auto-shuts down after 10 minutes of inactivity (configurable via IDLE_TIMEOUT_MINUTES env var, 0 to disable)

API

Endpoint	Method	Description
`/status`	GET	Server status, loaded reference, and avatar info
`/reference`	POST	Upload reference voice audio (WAV, MP3, FLAC, etc.)
`/avatar`	POST	Upload face image for lip-sync animation (JPG, PNG, WebP)
`/settings`	GET	Current face animation settings
`/settings`	PATCH	Update face animation settings (`face_fps`, `use_tiny_vae`)
`/ws/convert`	WS	Real-time PCM voice conversion stream
`/ws/face`	WS	Real-time face animation frames (JPEG binary)

The server binds to 127.0.0.1 only — it is not accessible from other machines.

Developer Setup

On Windows, setup.bat handles everything (Python, PyTorch CUDA, seed-vc, MuseTalk, deps). On macOS, follow the install instructions above.

# Run the server directly:
python main.py

# Or with the system tray launcher (Windows):
python tray_launcher.py

Project Structure

main.py                    — FastAPI server (REST + WebSocket endpoints)
voice_converter.py         — Seed-VC wrapper with streaming buffers + SOLA crossfade
face_animator.py           — MuseTalk wrapper for audio-driven face animation
dashboard.py               — Rich console dashboard for server status monitoring
tray_launcher.py           — System tray icon, starts/stops the server
launch.vbs                 — Hidden-console VBS launcher (full GPU priority)
patch_bigvgan.py           — Patches BigVGAN for newer huggingface_hub versions
download_musetalk_models.py — Downloads MuseTalk model weights from HuggingFace
setup.bat                  — Windows one-click setup
start.bat                  — Windows server launcher (console)
installer/                 — Inno Setup installer script
.github/workflows/         — CI: builds the Windows installer on tag push

Configuration

Env Variable	Default	Description
`IDLE_TIMEOUT_MINUTES`	`10`	Auto-shutdown after N minutes of inactivity. `0` to disable.
`CORS_ORIGINS`	(empty)	Comma-separated additional CORS origins. Chrome extension origins are allowed by default.
`SEED_VC_ROOT`	`./seed-vc`	Path to the Seed-VC source directory.
`MUSETALK_ROOT`	`./musetalk`	Path to the MuseTalk source directory.

License

GPL-3.0-or-later

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EchoVeil Server

Requirements

Install (Windows)

Install (macOS)

How It Works

API

Developer Setup

Project Structure

Configuration

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github		.github
installer		installer
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
FAILURES.md		FAILURES.md
LICENSE		LICENSE
README.md		README.md
dashboard.py		dashboard.py
download_musetalk_models.py		download_musetalk_models.py
download_seedvc_models.py		download_seedvc_models.py
face_animator.py		face_animator.py
icon.ico		icon.ico
icon.png		icon.png
launch.vbs		launch.vbs
main.py		main.py
patch_bigvgan.py		patch_bigvgan.py
patch_musetalk.py		patch_musetalk.py
requirements.txt		requirements.txt
setup.bat		setup.bat
start.bat		start.bat
tray_launcher.py		tray_launcher.py
voice_converter.py		voice_converter.py

Folders and files

Latest commit

History

Repository files navigation

EchoVeil Server

Requirements

Install (Windows)

Install (macOS)

How It Works

API

Developer Setup

Project Structure

Configuration

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages