Best Open Source Speech Software 2025

Speech Software

Speech Artificial Intelligence Clear Filters

Browse free open source Speech software and projects below. Use the toggles on the left to filter open source Speech software by OS, license, language, programming language, and project status.

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Stay in Flow. Let Zenflow Handle the Heavy Lifting.
Your AI engineering control center. Zenflow turns specs into shipped features using parallel agents and multi-repo intelligence.

Zenflow is your engineering control center, turning specs into shipped features. Parallel agents handle coding, testing, and refactoring with real repo context. Multi-agent workflows remove bottlenecks and automate routine work so developers stay focused and in flow.

Try free now
1

eSpeak: speech synthesis

Text to Speech engine for English and many other languages. Compact size with clear but artificial pronunciation. Available as a command-line program with many options, a shared library for Linux, and a Windows SAPI5 version.

40 Reviews

Downloads: 2,162 This Week

Last Update: 2021-11-17
See Project
2

Buzz

Transcribe and translate audio offline on your personal computer

Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for Windows: App is not signed, you will get a warning when you install it. Select More info -> Run anyway.

Downloads: 3,540 This Week

Last Update: 1 day ago
See Project
3

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 166 This Week

Last Update: 2025-12-18
See Project
4

Open JTalk

Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.

Downloads: 748 This Week

Last Update: 2018-12-25
See Project
Cloud-based help desk software with ServoDesk
Full access to Enterprise features. No credit card required.

What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.

Try ServoDesk for free
5

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 21 This Week

Last Update: 2021-04-08
See Project
6

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 12 This Week

Last Update: 2025-11-19
See Project
7

Simple TTS Reader

A small clipboard reader

Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.

4 Reviews

Downloads: 70 This Week

Last Update: 2025-10-27
See Project
8

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.

Downloads: 3 This Week

Last Update: 2022-09-03
See Project
9

Voxal voice changer

Transform your voice in real-time voxal voice changer

Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content, we have the perfect voice for you. Audio can be captured from various sources, pre-listening is available, and the most popular audio formats are supported.

1 Review

Downloads: 12 This Week

Last Update: 2025-11-16
See Project
Desktop and Mobile Device Management Software
It's a modern take on desktop management that can be scaled as per organizational needs.

Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.

Learn More
10

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. SkryBot Prawo - for judicial professionals. 2. SkryBot Administracyjny - for civil and government administration. 3. SkryBot Medycyna Rodzinna - for physicians Professional version of SkryBot (commercial) offers you: 1. Audio conversion and cutting sound files into smaller ones. 2. Searching for words or phrases in sound files (recognized by SkryBot). 3. Editing sound files and automatic cutting off long silence parts in audio file.

2 Reviews

Downloads: 6 This Week

Last Update: 2020-03-15
See Project
11

Epos TTS System

Epos is a language independent rule-driven Text-to-Speech (TTS) system

Epos is a language independent rule-driven Text-to-Speech (TTS) system primarily designed to serve as a research tool. Epos is (or tries to be) independent of the language processed, linguistic description method, and computing environment.

1 Review

Downloads: 9 This Week

Last Update: 2015-03-31
See Project
12

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 3 This Week

Last Update: 2018-07-23
See Project
13

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 11 This Week

Last Update: 2018-05-11
See Project
14

Text to Voice

'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as

Downloads: 8 This Week

Last Update: 2014-04-25
See Project
15

sourcesinc

Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc

Downloads: 8 This Week

Last Update: 2023-12-05
See Project
16

Text2Speech

Text2Speech is a small and easy to use Text To Speech (TTS) application written in C#. It uses the Microsoft .NET Framework 2.0 to run.

3 Reviews

Downloads: 2 This Week

Last Update: 2013-04-09
See Project
17

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 4 This Week

Last Update: 2017-12-17
See Project
18

AhoTTS Multilingual, a Multilingual TTS

Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/

1 Review

Downloads: 2 This Week

Last Update: 2019-11-29
See Project
19

Italian FESTIVAL

Development of the Italian Version of FESTIVAL Text to Speech synthesis system

Downloads: 2 This Week

Last Update: 2014-05-18
See Project
20

TTS_chung_dll

a simple dll interface for TTS text to speech SAPI 5.1

TTS chung dll is a very simple dll interface for TTS text to speech (SAPI5) written in freebasic to use with any programming language, .Comes with c++ , LibertyBasic and freebasic calling examples. Accepts XML tags. example of call : myTTSinit() myTTSspeak(@"hello") myTTSclose()

Downloads: 2 This Week

Last Update: 2020-04-01
See Project
21

rsynth - Text-to-Speech (Formant Synth)

To provide basic text-to-speech capability on as many platforms and for as many spoken languages as possible by formant synthesis from an International Phonetic Alphabet representation.

Downloads: 2 This Week

Last Update: 2013-03-07
See Project
22

Sayz Me

Sayz Me is a text-to-speech application for Windows. Text can be typed in or read from clipboard. Words are highlighted when spoken. Select voice, adjust reading speed, voice pitch, font and color. Simple and easy to use.

2 Reviews

Downloads: 2 This Week

Last Update: 2013-04-11
See Project
23

Osmosis TTS

Text to Speech application with searching capabilities.

Osmosis TTS is a Text-to-Speech application with a built in browser and the ability to easily search for terms in the text using configurable search providers including search engines and dictionaries. It is particularly useful for language learning as one can easily search for foreign words using external dictionary websites. Text is spoken by copying text to the clipboard, and text can be queued up by continuing to copy new text to the clipboard while Osmosis TTS is speaking. Text is displayed in a large window which has configurable fonts and color for users with low vision. Other features include: Saving text as a WAV file, MP3 encoding using LAME mp3 encoder, maintaining a search history, and ability to use and configure standard SAPI TTS voices.

Downloads: 1 This Week

Last Update: 2016-10-14
See Project
24

Russian Text-to-speech programs

читание, чтение, говорение

For Windows (on Linux trought Wine can work) 3 russian text-to-speech programs (Chitanie, Chtenie and Govorenie). If you want donate. paypal.me/alkbab Читание, Чтение, Говорение есть программы пробующие преобразовать русский текст в русскую речь . Для Windows. На Linux через Wine... Кто хочет может пожертвовать paypal.me/alkbab

Downloads: 1 This Week

Last Update: 2025-11-04
See Project
25

Siomdo TTS

Indonesian Text to Speech program

Siomdo is an Indonesian Text to Speech program. Siomdo adalah program text to speech (TTS) bahasa Indonesia. Program Siomdo dapat membaca teks bahasa Indonesia dan mengucapkannya lewat speaker komputer. Selain itu juga dapat mengkonversi teks ke dalam bentuk file WAV.

Downloads: 1 This Week

Last Update: 2023-04-19
See Project