
-
Intuitive Creation. Inc
- Vancouver, BC, Canada
-
02:19
(UTC -07:00) - https://benjame.github.io/
🤖 Artificial Intelligence
DaVinci toolkit aims at high-quality multimedia content creation which plays an important role in modern work and life. The targeted features can include both low-level image and video enhancement …
Port of OpenAI's Whisper model in C/C++
High-Resolution Image Synthesis with Latent Diffusion Models
✨✨Latest Advances on Multimodal Large Language Models
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
faster_whisper GUI with PySide6
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative speech model for daily dialogue.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Robust Speech Recognition via Large-Scale Weak Supervision
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Easily train a good VC model with voice data <= 10 mins!
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.