🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one file!
-
Updated
Oct 29, 2025 - TypeScript
🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one file!
Real-time voice agents with parallel async background sub-agents — conversations continue naturally while tasks run • Join the builders → https://discord.gg/mqxKaN3UKC
LiveKit voice app validation skill. Use when building, debugging, or declaring working any LiveKit voice agent, Agents UI app, or React/Next.js LiveKit project. Enforces evidence-based validation before reporting a session, token endpoint, worker, transcript, or end-to-end voice interaction as complete.
Open-source realtime voice agent server in Go with WebRTC (WHIP), barge-in, streaming STT/LLM/TTS pipelines, plugin system, multi-language SDKs, SIP telephony, ESP32 support & fully local mode.
An AI-powered object detection system using YOLOv8 to identify and locate graffiti across various contexts including walls, buildings, over-bridges, vehicles, and other surfaces.
Real-time hand sign recognition using LSTM-based models for sequence detection from video frames.
Voice agent prototype for structured clinical interviewing, with VAD-based interruption handling, modular ASR/LLM/TTS backends, and dialogue workflow control.
LiveKit Agents UI demo showing a voice AI assistant that schedules roof inspections using real-time voice interaction, visualizers, and booking workflow.
A real-time (<500ms) voice AI concierge built with Next.js, FastAPI, and Gemini 2.5 Flash Lite. Features local RAG (ChromaDB) for policy retrieval, Tool Calling for live booking, and event-driven CRM logging to Google Sheets.
Traffyx-AI — Traffic Forecasting & Urban Mobility Intelligence System Applied machine learning system for traffic prediction, congestion analysis, and real-world spatiotemporal data modeling.
Production-ready real-time voice AI pipeline integrating Twilio Media Streams, streaming ASR (Deepgram), LLM reasoning, and live analytics dashboard. Designed for ultra-low latency conversational intelligence in call center and healthcare environments.
Real-time face verification system using MediaPipe Face Mesh and landmark-based geometric feature extraction for improved accuracy and robustness.
High-performance async Python backend for real-time AI conversations with Quart, Supabase, and OpenAI.
Example apps showcase what can be build with the Livepeer BYOC workflow.
Most AI tools help you after the call. PitchPulse helps you during it — guiding discovery and building your pitch in real time, so you can close while others guess.
A face recognition system implemented using Principal Component Analysis (PCA) and Artificial Neural Networks (ANN) that extracts eigenfaces for dimensionality reduction and performs identity classification using a neural network model.
VoxGuard is a real-time multimodal scam detection system for live calls, built with Gemini Live API, Rust WASM audio streaming, and psychological manipulation scoring.
Add a description, image, and links to the realtime-ai topic page so that developers can more easily learn about it.
To associate your repository with the realtime-ai topic, visit your repo's landing page and select "manage topics."