RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 70,820 7,731 Updated Jan 4, 2026

upbit / GPT-SoVITS

Forked from RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 17 1 Updated Apr 27, 2024

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,266 132 Updated May 30, 2025

chen700564 / RGB

Python 354 36 Updated May 17, 2024

vocodedev / vocode-core

🤖 Build voice-based LLM agents. Modular + open source.

Python 3,673 646 Updated Nov 15, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,997 1,620 Updated Jan 1, 2026

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,695 1,124 Updated Jul 26, 2025

coqui-ai / xtts-streaming-server

Python 357 107 Updated Jun 26, 2024

FreedomIntelligence / AceGPT

Python 127 10 Updated Mar 3, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,100 5,884 Updated Aug 16, 2024

WhisperSpeech / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,550 263 Updated Dec 14, 2025

kpu / kenlm

KenLM: Faster and Smaller Language Model Queries

C++ 2,714 534 Updated Mar 30, 2025

facebookresearch / fastText

Library for fast text representation and classification.

HTML 26,466 4,815 Updated Mar 22, 2024

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,114 3,933 Updated Jan 4, 2026

wenet-e2e / wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 411 61 Updated Nov 20, 2025

PlayVoice / vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,224 177 Updated Feb 5, 2024

innnky / emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Jupyter Notebook 1,397 169 Updated Mar 30, 2023

ZuodaoTech / everyone-can-use-english

人人都能用英语

TypeScript 33,172 4,681 Updated Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vshanyiao

Block or report vshanyiao

Stars

nipponjo / tts-arabic-pytorch

sierra-research / tau-bench

lipku / LiveTalking

aiortc / aiortc

kestra-io / kestra

HLTSingapore / Emotional-Speech-Data

beyondExp / B-Llama3-o

2DIPW / gpt_sovits_infer_with_emotion

2noise / ChatTTS

metavoiceio / metavoice-src

livekit / agents

yamahigashi / Wav2Vec2FBX

infiniflow / ragflow