Clone a voice in 5 seconds to generate arbitrary speech in real-time
Instant voice cloning by MIT and MyShell. Audio foundation model
Visual inspection tool for .dsk Spectrum/Amstrad disk images
A sound cloning tool with a web interface, using your voice
Clone with Python! Data structures for double stranded DNA
Generate audiobooks from e-books, voice cloning & 1107+ languages
A simple, high-quality voice conversion tool focused on ease of use
1 min voice data can also be used to train a good TTS model
Industrial-level controllable zero-shot text-to-speech system
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Official PyTorch Implementation
Community-maintained approach to improving access to GitHub services
Git extension for versioning large files
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Open-source framework for intelligent speech interaction
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
Create deep copies (clones) of your objects
Video translation and dubbing tool powered by LLMs
MARS5 speech model (TTS) from CAMB.AI
Real-time voice interactive digital human
Comprehensive Gradio WebUI for audio processing
A deep learning toolkit for Text-to-Speech, battle-tested in research
Foundational model for human-like, expressive TTS