Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
An easy way to perform background job processing in .NET and .NET Core applications. No Windows Service or separate process required
Open-source web application framework for ASP.NET Core! Offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET. Provides the fundamental in…
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models such as paraformer, sensevoice, fireredasr, zipformer, moonsh…
Noise supression using deep filtering
c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
Python tool for converting files and office documents to Markdown.
A Django content management system focused on flexibility and user experience
Specification and documentation for the Model Context Protocol
Wrapper around the google SentencePiece tokenizer. Used to tokenize text for language models and other NLP tasks.
C++ wrapper for SentencePiece with C-ABI for C# P/Invoke integration
A Conversational Speech Generation Model
💫 Industrial-strength Natural Language Processing (NLP) in Python
🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, a…
Thai natural language processing in Python

