MLX-Audio

MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.

Features

Apple-Silicon-optimized speech library built on MLX for fast local inference
Command-line and Python APIs for TTS with controls for voice, speed, language, and output format
Web interface with live waveform and 3D audio visualization plus file upload and playback
Support for multi-language text-to-speech and transcription scenarios
Quantization options and efficient model loading for better performance on resource-limited devices
Built-in examples for tasks like audiobook creation and integrated FastAPI server for HTTP access

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow MLX-Audio

MLX-Audio Web Site

Other Useful Business Software

MyQ Print Management Software

SAVE TIME WITH PERSONALIZED PRINT SOLUTIONS

Boost your digital or traditional workplace with MyQ’s secure print and scan solutions that respect your time and help you focus on what you do best.

Learn More

Rate This Project

User Reviews

Be the first to post a review of MLX-Audio!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28

Similar Business Software

Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
Murf AI

Murf API is an advanced text-to-speech (TTS) solution that transforms written text into natural, lifelike voiceovers with remarkable accuracy and ease. It empowers developers and businesses with a suite of sophisticated features, including pitch and speed modulation, audio duration adjustments,...

See Software
Piper TTS

Piper is a fast, local neural text-to-speech (TTS) system optimized for devices like the Raspberry Pi 4, designed to deliver high-quality speech synthesis without relying on cloud services. It utilizes neural network models trained with VITS and exported to ONNX Runtime, enabling efficient and...

See Software
TheTechBrain AI

A comprehensive suite of AI-powered solutions designed to enhance productivity and streamline workflows. Available as a convenient app on both iOS and the Google Play Store, Smart AI Tools offers a wide range of features and capabilities. Here's what you can expect: AI Templates: Access a...

See Software
BeyondWords

BeyondWords is the AI voice platform that brings frictionless audio publishing to writers, newsrooms, and businesses. Every user gets access to 550+ lifelike AI voices across 140+ language locales, and there's the option to commission custom voices. Users can sync their CMS using the API, RSS...

See Software
TTSynth

TTSynth is a free online TTS maker. Type or paste your text into the TTS maker input box to start the conversion process using TTS AI. Choose the language and voice from our TTS online options for the desired accent and tone. Click 'generate' to create the speech and download the TTS MP3 file....

See Software

Report inappropriate content