Skip to content
View yanngrecque's full-sized avatar

Block or report yanngrecque

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,095 644 Updated Jan 3, 2026

🐹 Deep clean and optimize your Mac.

Shell 24,788 658 Updated Jan 3, 2026

Fast and memory-efficient exact attention

Python 27 1 Updated Dec 2, 2024

Official repository of DARE: dLLM Alignment and Reinforcement Executor

Python 148 2 Updated Dec 27, 2025

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

Python 136 8 Updated Dec 31, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,420 211 Updated Jan 3, 2026

A Gym for Agentic LLMs

Python 411 27 Updated Dec 31, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,564 190 Updated Jan 3, 2026

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 492 38 Updated Dec 31, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,161 896 Updated Jan 3, 2026

A simple, performant and scalable Jax LLM!

Python 2,068 444 Updated Jan 3, 2026

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,171 348 Updated Dec 30, 2025

Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"

Python 122 7 Updated Feb 14, 2025

NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025

Jupyter Notebook 404 103 Updated Apr 19, 2025

Standardized environment infrastructure for Agentic AI development.

Python 222 24 Updated Dec 31, 2025
Python 658 69 Updated Jan 4, 2026

Training VLM agents with multi-turn reinforcement learning

Python 363 40 Updated Jan 2, 2026

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 56 10 Updated Dec 18, 2025

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 797 116 Updated Jan 3, 2026

Chai-1, SOTA model for biomolecular structure prediction

Python 1,847 256 Updated Dec 3, 2025

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 743 46 Updated Dec 20, 2025

TUI for the Slurm Workload Manager

Rust 386 20 Updated Dec 23, 2025

Sharp Monocular View Synthesis in Less Than a Second

Python 6,405 410 Updated Dec 19, 2025

Where the GPU for the Web work happens!

Bikeshed 5,276 356 Updated Jan 2, 2026

我的开发经验+提示词库=vibecoding工作站;My development experience + prompt dictionary = Vibecoding workstation;ניסיון הפיתוח שלי + מילון פרומפטים = תחנת עבודה Vibecoding;私の開発経験 + プロンプト辞書 = Vibecoding ワークステーション;나…

Python 5,966 631 Updated Jan 4, 2026

VLA-0: Building State-of-the-Art VLAs with Zero Modification

Python 416 21 Updated Dec 9, 2025

JAX bindings for the flash-attention3 kernels

C++ 18 3 Updated Jan 2, 2026

A blazingly fast, open-source application server with type-safe APIs, built-in WebAssembly runtime, realtime, auth, and admin UI built on Rust, SQLite & Wasmtime.

Rust 4,344 123 Updated Jan 2, 2026

Explanation to key concepts in ML

8,251 673 Updated Jun 30, 2025
Next