Skip to content
View benjame's full-sized avatar
:octocat:
Developing a Creativity LLM Application
:octocat:
Developing a Creativity LLM Application

Block or report benjame

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🤖 Artificial Intelligence

27 repositories

DaVinci toolkit aims at high-quality multimedia content creation which plays an important role in modern work and life. The targeted features can include both low-level image and video enhancement …

141 16 Updated Aug 9, 2022

Port of OpenAI's Whisper model in C/C++

C++ 40,207 4,268 Updated May 23, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 41,018 5,249 Updated Oct 10, 2024

✨✨Latest Advances on Multimodal Large Language Models

15,293 989 Updated May 15, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,088 718 Updated Apr 12, 2025

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…

429 31 Updated Sep 28, 2022

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,292 635 Updated Sep 26, 2024

Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.

Jupyter Notebook 13,845 3,876 Updated Jan 7, 2025

faster_whisper GUI with PySide6

Python 2,412 141 Updated Dec 8, 2024

Build AI Agents, Visually

TypeScript 38,958 20,109 Updated May 25, 2025

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 4,718 440 Updated May 23, 2025

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 3,866 423 Updated Jan 3, 2025

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,892 207 Updated Dec 5, 2023

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,150 2,020 Updated Sep 26, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 46,746 5,146 Updated Apr 25, 2025

A generative speech model for daily dialogue.

Python 36,328 3,926 Updated May 23, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,371 3,352 Updated Apr 19, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,197 9,914 Updated May 13, 2025

LLM inference in C/C++

C++ 80,825 11,895 Updated May 25, 2025

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 32,004 4,302 Updated May 25, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 98,257 27,200 Updated May 23, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 175,563 45,732 Updated May 25, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 25,384 2,582 Updated May 23, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,351 2,489 Updated May 22, 2025

A Fast TTS Engine

Python 501 38 Updated Jan 23, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 29,556 4,140 Updated Nov 24, 2024

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 2,770 369 Updated May 23, 2025