PyTorch code and models for VJEPA2 self-supervised learning from video
CLIP, Predict the most relevant text snippet given an image
Physical Symbolic Optimization
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Implementation of Video Diffusion Models
Create videos with Stable Diffusion
An open-source toolkit for monitoring Language Learning Models (LLMs)
PyTorch code and models for V-JEPA self-supervised learning from video
Topic Modelling for Humans
A Hyperparameter Tuning Library for Keras
Library of self-supervised methods for visual representation
InvokeAI is a leading creative engine for Stable Diffusion models
Medical imaging toolkit for deep learning
PyTorch version of Stable Baselines
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Superfast AI decision making and processing of multi-modal data
Implementation of Recurrent Interface Network (RIN)
A fast library for AutoML and tuning
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Diffusion Transformer with Fine-Grained Chinese Understanding
Language modeling in a sentence representation space
Implementation of the Surya Foundation Model for Heliophysics
Proofs, cases, concept supplements, and reference explanations
Generate 3D objects conditioned on text or images
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation