Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

Features

  • YouTube Downloader: You can download YouTube videos and extract the audio (mp3, wav, flac)
  • Vocal Remover: Use MDX-Net supported in UVR5 and the Demucs engine developed by Meta for voice separation
  • STT: Supports speech-to-text conversion with Whisper, Faster-Whisper, and whisper-timestamped
  • Translator: Google Translator. Short text translation, subtitle file translation
  • TTS: Text to Speech. Edge-TTS. E2 and F5-TTS that support zero-shot voice cloning
  • We provide Celeb voices for free. Try creating your own podcast. You can check it in the F5-TTS tab

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow Voice-Pro

Voice-Pro Web Site

Other Useful Business Software
Zenflow- The AI Workflow Engine for Software Devs Icon
Zenflow- The AI Workflow Engine for Software Devs

Parallel agents. Multi-agent orchestration. Specs that turn into shipped code. Zenflow automates planning, coding, testing, and verification.

Zenflow is the AI workflow engine built for real teams. Parallel agents plan, code, test, and verify in one workflow. With spec-driven development and deep context, Zenflow turns requirements into production-ready output so teams ship faster and stay in flow.
Try free now
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • Tried Voice-Pro on my RTX 3080 desktop. The quality is truly excellent, and it includes voice cloning capabilities using F5-TTS and CosyVoice. The installation was very simple, and the usage is quite intuitive, so I think it's worth a try. Before installing this project, I checked their YouTube demo video, and I was able to achieve the same results on my desktop as shown in the demo. It offers transcription, translation, Edge-TTS and kokoro through the Gradio WebUI. It's a great tool for youtube creators. I hope you find it helpful.
Read more reviews >

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2024-11-27