Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,124 645 Updated Jan 8, 2026

tokenbender / mHC-manifold-constrained-hyper-connections

implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880

Python 191 14 Updated Jan 4, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 516 25 Updated Dec 23, 2025

0xPlaygrounds / rig

⚙️🦀 Build modular and scalable LLM Applications in Rust

Rust 5,407 619 Updated Jan 7, 2026

zlab-princeton / llm-pruning-collection

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 49 6 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yanngrecque

Block or report yanngrecque

LLM

RiddleHe / llm-interp

tursodatabase / agentfs

microsoft / typescript-go

mll-lab-nu / VAGEN

AI-Hypercomputer / maxtext

skypilot-org / skypilot

Wenyueh / MinivLLM

kyutai-labs / jax-flash-attn3

yjyddq / DARE

togethercomputer / flash-attention-3

OpenPipe / ART

tokenbender / mHC-manifold-constrained-hyper-connections

tile-ai / TileRT

0xPlaygrounds / rig

zlab-princeton / llm-pruning-collection