speech-detection

Star

Here are 35 public repositories matching this topic...

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Jun 7, 2026
Python

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Mar 12, 2026
Python

gkonovalov / android-vad

Star

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Jul 15, 2025
C

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series speech pytorch artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated May 29, 2026
Python

gtreshchev / RuntimeSpeechRecognizer

Star

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated Feb 23, 2025
C++

tympanix / subsync

Star

Synchronize your subtitles using machine learning

machine-learning neural-network delay subtitles subtitle fix mfcc shift subsync speech-detection shift-subtitle

Updated Sep 18, 2023
Python

edusense / edusense

Star

EduSense: Practical Classroom Sensing at Scale

audio teachers classroom tracking machine-learning computer-vision pedagogy instructors posture gaze sensing speech-detection hand-raise

Updated Oct 28, 2024
Python

baochuquan / ios-vad

Star

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Nov 14, 2024
Swift

bbc / bbc-speech-segmenter

Star

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

automatic-speech-recognition speech-to-text voice-activity-detection speech-detection x-vectors endpoint-detection

Updated Jun 17, 2024
Shell

sepnic / litevad

Star

Speech-end detection library, based on WebRTC's VAD engine

webrtc voice-activity-detection speech-detection

Updated May 10, 2025
C

wavekat / wavekat-vad

Star

Voice Activity Detection library for Rust with a unified trait interface over multiple backends (WebRTC VAD, Silero). Includes vad-lab, a web-based tool for live experimentation and comparison.

audio rust vad audio-processing voice-activity-detection webrtc-vad speech-detection silero-vad ten-vad wavekat fireredvad

Updated Jun 4, 2026
Rust

PranavPutsa1006 / Speaker-Diarization

Star

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

deep-learning neural-networks speech-to-text mfcc speaker-diarization spectral-clustering voice-activity-detection speech-segmentation speech-detection speech-transcription embeddings-extraction

Updated Jun 18, 2023
Jupyter Notebook

PINTO0309 / VSDLM

Sponsor

Star

Visual only speech detection by lip movement. There are countless situations where you can't hear the audio, and it's really frustrating.

pytorch tensorrt onnx speech-detection posture-recognition

Updated Apr 17, 2026
Python

wavekat / wavekat-lab

Star

Developer experimentation tools for the WaveKat libraries. Includes vad-lab, a web-based tool for testing and comparing VAD backends side by side.

audio rust voice developer-tools vad audio-processing speech-detection voice-ai wavekat

Updated Jun 4, 2026
Jupyter Notebook

isbendiyarovanezrin / SpeechDetection

Star

Speech Detection 💬

vanilla-javascript web-speech-api speech-recognition javascript30 speech-to-text speech-detection

Updated Mar 22, 2022
CSS

pocketpiglet / pocketpiglet-ios

Star

PocketPiglet for iOS

game ios qt multimedia qml voice qt5 animations pet vad talking voice-activity-detection speech-detection

Updated Nov 29, 2022
QML

sepnic / vadrecorder

Star

VadRecorder based webrtc's VAD engine and vo-aac encoder, recording valid speech and discarding silence/noise data

webrtc audio-recorder audio-encoder speech-detection

Updated Jun 21, 2024
C++

AmosLoVerde / autospeechcut

Star

CLI Python basata su AI per rimuovere automaticamente silenzi e segmenti senza parlato dai video, utilizzando Silero VAD e FFmpeg.

python automation ffmpeg mp4 video-processing vad video-editing audio-processing cli-tool voice-activity-detection speech-detection silence-removal silero-vad

Updated Mar 21, 2026
Python

rogerchappel / bargekit

Star

Local-first VAD, barge-in, and turn-taking primitives for interruptible voice agents.

duplex microphone vad agents voice-ui speech-detection local-first turn-taking voice-agent barge-in echo-guard

Updated Jun 9, 2026
JavaScript

Otosaku / NeMoVAD-iOS

Star

Swift library for Voice Activity Detection (VAD) using NVIDIA NeMo MarbleNet model converted to CoreML. Detect speech segments in real-time on iOS/macOS with high accuracy and low latency.

macos swift ios real-time speech-recognition vad spm audio-processing swift-package voice-activity-detection coreml speech-detection on-device-ml nvidia-nemo marblenet

Updated Jun 4, 2026
Swift

Improve this page

Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-detection

Here are 35 public repositories matching this topic...

smacke / ffsubsync

ina-foss / inaSpeechSegmenter

gkonovalov / android-vad

filippogiruzzi / voice_activity_detection

gtreshchev / RuntimeSpeechRecognizer

tympanix / subsync

edusense / edusense

baochuquan / ios-vad

bbc / bbc-speech-segmenter

sepnic / litevad

wavekat / wavekat-vad

PranavPutsa1006 / Speaker-Diarization

PINTO0309 / VSDLM

wavekat / wavekat-lab

isbendiyarovanezrin / SpeechDetection

pocketpiglet / pocketpiglet-ios

sepnic / vadrecorder

AmosLoVerde / autospeechcut

rogerchappel / bargekit

Otosaku / NeMoVAD-iOS

Improve this page

Add this topic to your repo