faster_whisper GUI with PySide6
-
Updated
Dec 8, 2024 - Python
faster_whisper GUI with PySide6
Veldra — talk an agent into existence, then watch it grow. A self-hostable, local-first agent platform: describe what you need in plain language and it compiles a working agent tools, MCP, RAG, teams. The more you use it, the better it gets agents learn from your feedback and reshape as you talk.
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
Telegram bridge for the Pi coding agent — continue sessions from your phone with voice, images, and handback
Instant dictation app for Mac
Fix the 2-5 second push-to-talk activation delay on macOS. Keeps microphone hardware awake for instant voice transcription with AirPods, Bluetooth headsets, and built-in mic. Works with SuperWhisper, WhisperFlow, Wispr Flow, and any push-to-talk app on Apple Silicon (M1/M2/M3/M4).
NotesGPT 📋✨ seamlessly converts your voice notes into organized summaries and clear action items using AI 🤖.
GPU-accelerated local voice transcriber with Whisper.cpp and Gemini API refinement for developers
The first Minecraft AI that doesn't just talk—it lives in your world. High-performance Gemini-driven cognitive companion for Fabric 1.21.1.
An open-source, educational app for speech-to-text & text translation that runs entirely in your browser: record or upload audio, transcribe with Whisper-style models, translate into many languages, & export results—without sending audio or transcripts to an app server for processing. Heavy ML runs in Web Workers via Transformers.js & ONNX Runtime
🎙️ VaibVoice – Real-time AI voice transcription with smart formatting and full customization.
Turn your Android phone into an MCP (Model Context Protocol) server. AI agents and desktop scripts can call your phone for live data and actions over LAN.
Local dual-chat workspace for comparing ChatGPT and Ollama, storing side-by-side replies, and building training-oriented datasets.
WhatsApp channel plugin for Claude Code — connect your WhatsApp to an AI agent. QR scan, voice transcription, access control, media support.
Telegram client for OpenCode with multi-user RBAC, task queues, sandboxing, 40+ commands, and voice support. Run AI coding tasks from your phone.
Browser-based audio transcription using Whisper and Transformers.js, storing results in Backblaze B2 Cloud Storage.
Desktop application for automatic audio and video file transcription using the OpenAI Whisper / faster-whisper model. Written by Artificial Intelligence
A lightweight voice transcription bot for telegram.
AI based RTP listener
AI Meeting Logger - Automate Work Hours with Voice AI
Add a description, image, and links to the voice-transcription topic page so that developers can more easily learn about it.
To associate your repository with the voice-transcription topic, visit your repo's landing page and select "manage topics."