Showing 165 open source projects for "arabic speech to text"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    ...You can configure your own API Key to use your own account's free quota, such as Tencent's free translation quota of 5 million characters per month, IBM's 500-minute speech-to-text free quota (tern. best The domain name has expired and I don't want to renew it.) Azure speech-to-text and DeepL free version have problems, it is normal to not use it, please wait for the next version to fix. Machine translation of subtitle files, use machine translation to process files.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 3
    Koodo Reader

    Koodo Reader

    A modern ebook manager and reader with sync and backup

    ...Customize the source folder and synchronize among multiple devices using OneDrive, iCloud, Dropbox, etc. Single-column, two-column, or continuous scrolling layouts. Text-to-speech, translation, progress slider, touch screen support, batch import. Add bookmarks, notes, highlights to your books. Adjust font size, font family, line-spacing, paragraph spacing, background color, text color, margins, and brightness. Night mode and theme color. Text highlight, underline, boldness, italics and shadow. Adjust font size, font family, line-spacing, paragraph spacing, background color, text color, margins, and brightness.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 4
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Winningbidder.com provides free online auction software for nonprofits. Icon
    Winningbidder.com provides free online auction software for nonprofits.

    Simple and Free Online Auction Software

    Our software is simple to use, allowing organizers to launch an auction in minutes. That doesn't mean it lacks features. Bidders enjoy max bidding, outbid notifications, and popcorn bidding.
    Learn More
  • 5

    Omilo - a text to speech application

    Omilo is a simple text to speech application

    Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6

    Russian Text-to-speech programs

    читание, чтение, говорение

    For Windows (on Linux trought Wine can work) 3 russian text-to-speech programs (Chitanie, Chtenie and Govorenie). If you want donate. paypal.me/alkbab Читание, Чтение, Говорение есть программы пробующие преобразовать русский текст в русскую речь . Для Windows. На Linux через Wine... Кто хочет может пожертвовать paypal.me/alkbab
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Buzz

    Buzz

    Transcribe and translate audio offline on your personal computer

    Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for...
    Leader badge
    Downloads: 3,735 This Week
    Last Update:
    See Project
  • 8
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray.
    Leader badge
    Downloads: 92 This Week
    Last Update:
    See Project
  • 9
    Anx Reader

    Anx Reader

    Featuring powerful AI capabilities and supporting e-book formats

    ...It supports major formats (EPUB, MOBI, AZW3, FB2, TXT) and integrates powerful AI tools for summarizing and intelligent navigation via OpenAI, Claude, Gemini, and DeepSeek. Anx also syncs progress, notes, and highlights over WebDAV, and offers rich analytics—including heatmaps and exportable reading summaries. UI customization and text-to-speech enhance the reading experience.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Notifii Track, a cloud-based software solution designed to streamline package management Icon
    Notifii Track, a cloud-based software solution designed to streamline package management

    We serve three primary verticals: offices and corporate buildings, multifamily housing and colleges and universities.

    Notifii Track is a cloud-based package tracking software for apartment offices, university mailrooms, and corporate mailrooms. Quickly and easily log packages as you receive them. Automatically alert the recipient via email and text message. Capture signature proof-of-pickup/delivery. Notifii Track saves you time (literally, just a few seconds to scan a package) and increases package accountability. Use Notifii Track through your web browser, or on any iOS/Android device. 30-day free trial available.
    Learn More
  • 10
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 12
    react-use

    react-use

    Component for React

    ...Tracks mouse hover state of some element. Display an element or video full-screen. Tracks location hash value. Tracks whether user is being inactive. Tracks an HTML element's intersection. Synthesizes speech from a text string. Tracks page navigation bar location state. Re-renders component, while tweening a number from 0 to 1. Tracks long press gesture of some element. Tracks state of a CSS media query. Tracks state of connected hardware devices. Returns a callback, which re-renders component when called. Tracks state of device's motion sensor. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    RemoteTTS

    RemoteTTS

    Tool to remotely activate Text-To-Speech (TTS) on a server

    The tool provides a simple TCP/UDP interface to let a remote machine perform TTS outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 141 This Week
    Last Update:
    See Project
  • 15
    Sintetizador GMS

    Sintetizador GMS

    Herramienta para Sintetizar voces en la PC

    Herramienta para escuchar documentos, puede copiar y pegar, guardar documentos y generar audio hablado.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    htmid

    htmid

    Generative Music For Beginners and Everyone Else

    Generative music is a fascinating and innovative approach to music creation that involves creating procedurally generated music that evolves and changes over time. Whether you're a beginner or a seasoned musician, this guide will introduce you to the world of generative music and show you how to create your own live music performances. Generative music is music that is ever-changing and created in real-time. It can be created by anyone, with or without musical experience. Learn how to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Tux Paint

    Tux Paint

    An award-winning drawing program for children of all ages

    Tux Paint is a free, award-winning drawing program originally created for children ages 3 to 12, but enjoyed by all! It combines an easy-to-use interface, fun sound effects, and an encouraging cartoon mascot who guides children as they use the program. You're presented with a blank canvas and a variety of drawing tools to help them be creative. Along with paintbrush, shapes and text, Tux Paint includes a "stamp" feature to add pre-drawn or photographic imagery to pictures, and a set of...
    Leader badge
    Downloads: 13,660 This Week
    Last Update:
    See Project
  • 18
    Time_limit

    Time_limit

    A windowed/full-screen countdown timer.

    A windowed / full-screen countdown timer. Colour and font size changes are used as warnings. Progress-bar gives a glance at the time stream. Three different modes are available: - time left; - time passed; - ordinary clock. When the time is over several possibilities are available: - to show the defined message; - to continue count the time; - to launch another application; - to close the count-down timer. Useful for speech, lecture or presentation timing. Colour / font...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 19
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    ...With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 22
    Gnome Subtitles
    Gnome Subtitles is a subtitle editor for the GNOME desktop. It supports the most common text-based subtitle formats, video previewing, timings synchronization and subtitle translation.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 23

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    ...We designed AugLy to include many specific data augmentations that users perform in real life on internet platforms like Facebook's -- for example making an image into a meme, overlaying text/emojis on images/videos, reposting a screenshot from social media. While AugLy contains more generic data augmentations as well, it will be particularly useful to you if you're working on a problem like copy detection, hate speech detection, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Capture2Text

    Capture2Text

    Quickly OCR part of the screen and save resulting text to clipboard

    Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut. The resulting text will be saved to the clipboard by default. Supports 90+ languages including Chinese, English, French, German, Japanese, Korean, Russian, and Spanish. Portable and does not require installation. See http://capture2text.sourceforge.net for details.
    Leader badge
    Downloads: 2,668 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next