Skip to content

hanifabd/voice-activity-detection-client-side

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real-time Voice Activity Detection (VAD) Study Cases

This project focuses on Real-time Voice Activity Detection (VAD), enabling systems to detect when a user is speaking and when they are silent. By leveraging VAD, applications can achieve smarter, more human-like interactions through real-time speech analysis.

🔹 Key Features:

  • Real-time VAD – Detects voice activity with low latency.
  • Agent Interruption – Voice bots can gracefully stop talking when the user starts speaking.
  • Seamless Integration – Works with speech-to-text pipelines for live transcription.
  • Lightweight & Efficient – Optimized for real-time use cases.

🔹 Example Use Cases:

  • Simple Voice Bot: A conversational AI agent that listens and responds naturally by detecting user speech in real time.
  • Live Transcription (Real-time Transcription): Automatically captures speech-to-text while ignoring silence and background noise, making transcription more accurate and efficient.

Research and Prove of Concept - State of the Art 1

soa1-img

About

A real-time Voice Activity Detection (VAD) project that enables natural voice bot interactions with agent interruption and supports live transcription. Built with efficient speech processing and extended through Silero VAD implementation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors