🔊 Text-prompted Generative Audio Model - With the ability to clone voices
-
Updated
Aug 24, 2025 - Jupyter Notebook
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.
AI Voice Cloning Desktop Application that runs locally on your computer and doesn't cost anything to run
An OpenAI multimodel chatbot multi-AI-Tool all in one application for iOS built with Objective C. In addition to having near perfect memory recall abilities, app can do transcriptions, analyze attached files, generate images and videos, process image edits, responses can be given with elevenlabs voices and users can clone their own as well.
Run XTTS with Docker/Podman for voice fine-tuning in Gradio's Web UI
Synthesize studio-quality AI voiceovers, clone voices in seconds, and translate subtitles in 200+ languages, all fully offline on your device.
Add a description, image, and links to the ai-voice-clone topic page so that developers can more easily learn about it.
To associate your repository with the ai-voice-clone topic, visit your repo's landing page and select "manage topics."