Lists (8)
Sort Name ascending (A-Z)
Stars
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Fast and memory-efficient exact attention
Official repository of DARE: dLLM Alignment and Reinforcement Executor
Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation
SkyRL: A Modular Full-stack RL Library for LLMs
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).
A simple, performant and scalable Jax LLM!
🚀 Efficient implementations of state-of-the-art linear attention models
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025
Standardized environment infrastructure for Agentic AI development.
Training VLM agents with multi-turn reinforcement learning
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
Chai-1, SOTA model for biomolecular structure prediction
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…
Sharp Monocular View Synthesis in Less Than a Second
tukuaiai / vibe-coding-cn
Forked from EnzeD/vibe-coding我的开发经验+提示词库=vibecoding工作站;My development experience + prompt dictionary = Vibecoding workstation;ניסיון הפיתוח שלי + מילון פרומפטים = תחנת עבודה Vibecoding;私の開発経験 + プロンプト辞書 = Vibecoding ワークステーション;나…
VLA-0: Building State-of-the-Art VLAs with Zero Modification
JAX bindings for the flash-attention3 kernels
A blazingly fast, open-source application server with type-safe APIs, built-in WebAssembly runtime, realtime, auth, and admin UI built on Rust, SQLite & Wasmtime.