Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
A fast, local neural text to speech system
Speech-to-text, text-to-speech, and speaker recognition
Comprehensive Gradio WebUI for audio processing
Speech to Text to Speech, sends text as OSC messages
A deep learning toolkit for Text-to-Speech, battle-tested in research
A modern ebook manager and reader with sync and backup
State-of-the-art TTS model under 25MB
A free, open source, and extensible speech-to-text application
A robust, efficient, low-latency speech-to-text library
Speech recognition module for Python
Qwen3-omni is a natively end-to-end, omni-modal LLM
Transcribe any audio to text, translate and edit subtitles 100% locall
A generative speech model for daily dialogue
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Anki flashcards on Android
Open source personal AI Assistant for Linux, Windows and Mac
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Featuring powerful AI capabilities and supporting e-book formats
Examples and guides for using the Gemini API
A block-style editor with clean JSON output
Subtitle Creation Assistant
Models for the spaCy Natural Language Processing (NLP) library
Go efficient multilingual NLP and text segmentation