Port of OpenAI's Whisper model in C/C++
ImageBind One Embedding Space to Bind Them All
PyTorch code and models for VJEPA2 self-supervised learning from video
Training Large Language Model to Reason in a Continuous Latent Space
CLIP, Predict the most relevant text snippet given an image
Physical Symbolic Optimization
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Create videos with Stable Diffusion
Implementation of Video Diffusion Models
An open-source toolkit for monitoring Language Learning Models (LLMs)
PyTorch code and models for V-JEPA self-supervised learning from video
Topic Modelling for Humans
A Hyperparameter Tuning Library for Keras
Library of self-supervised methods for visual representation
InvokeAI is a leading creative engine for Stable Diffusion models
ESP32 Camera motion capture application to record JPEGs to SD card
Medical imaging toolkit for deep learning
Open Source Differentiable Computer Vision Library
PyTorch version of Stable Baselines
A Rust machine learning framework
Integrate cutting-edge LLM technology quickly and easily into your app
Superfast AI decision making and processing of multi-modal data
A distributed system for embedding-based vector retrieval
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Deep Learning Visualization Toolkit