Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
Features
- YouTube Downloader: You can download YouTube videos and extract the audio (mp3, wav, flac)
- Vocal Remover: Use MDX-Net supported in UVR5 and the Demucs engine developed by Meta for voice separation
- STT: Supports speech-to-text conversion with Whisper, Faster-Whisper, and whisper-timestamped
- Translator: Google Translator. Short text translation, subtitle file translation
- TTS: Text to Speech. Edge-TTS. E2 and F5-TTS that support zero-shot voice cloning
- We provide Celeb voices for free. Try creating your own podcast. You can check it in the F5-TTS tab
