RealtimeSTT

A robust, efficient, low-latency speech-to-text library

This is an exact mirror of the RealtimeSTT project, hosted at https://github.com/KoljaB/RealtimeSTT. SourceForge is not affiliated with RealtimeSTT.

Add a Review

Downloads: 2 This Week

Last Update: 2026-05-31

Download

Get an email when there's a new version of RealtimeSTT

Windows Mac Linux BSD ChromeOS

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Features

Real-time transcription via microphone
Wake-word and voice-activity detection
Asynchronous callback architecture
Nanosecond timing metadata
CLI and server modes with VAD filters
Low-latency suitable for live apps

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow RealtimeSTT

RealtimeSTT Web Site

Other Useful Business Software

The AI-powered unified PSA-RMM platform for modern MSPs.

Trusted PSA-RMM partner of MSPs worldwide

SuperOps.ai is the only PSA-RMM platform powered by intelligent automation and thoughtfully crafted for the new-age MSP. The platform also helps MSPs manage their projects, clients, and IT documents from a single place.

Learn More

Rate This Project

User Reviews

Be the first to post a review of RealtimeSTT!

Additional Project Details

Programming Language

Python

Related Categories

Python Speech to Text Software

Registered

2025-07-03

Similar Business Software

Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
Picovoice

Picovoice is the first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, Speech-to-Intent (intent detection) and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive...

See Software
GPT‑Realtime‑Whisper

GPT-Realtime-Whisper is OpenAI’s streaming transcription model built for low-latency speech-to-text experiences in live products. It transcribes audio as people speak, helping voice-enabled apps feel faster, more responsive, and more natural, from captions that appear in the moment to meeting...

See Software
Speechmatics

Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional...

See Software
Voxtral Transcribe 2

Voxtral Transcribe 2 is a next-generation family of speech-to-text models from Mistral AI that delivers ultra-low-latency, high-quality audio transcription and speaker diarization with broad language support. The suite includes Voxtral Mini Transcribe V2, optimized for batch transcription with...

See Software
Inworld Realtime STT

Inworld Realtime STT is a realtime streaming STT API that understands users beyond their words. It combines low-latency speech recognition with voice profiling, extracting emotion, vocal style, accent, age, and pitch directly from raw audio so downstream LLMs and TTS systems can respond with...

See Software

Report inappropriate content

The AI-powered unified PSA-RMM platform for modern MSPs.

Trusted PSA-RMM partner of MSPs worldwide

Learn More

Recommended Projects

WhisperX
Automatic Speech Recognition with Word-level Timestamps
Whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Handy STT
A free, open source, and extensible speech-to-text application
TTS Voice Wizard
Speech to Text to Speech, sends text as OSC messages
CMU Sphinx
Speech Recognition Toolkit