Skip to content
View yanngrecque's full-sized avatar

Block or report yanngrecque

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

15 repositories

A collection of lightweight interpretability scripts to understand how LLMs think

Python 88 7 Updated Jan 1, 2026

The filesystem for agents.

Rust 1,580 87 Updated Jan 7, 2026

Staging repo for development of native port of TypeScript

Go 23,678 782 Updated Jan 8, 2026

Training VLM agents with multi-turn reinforcement learning

Python 364 40 Updated Jan 2, 2026

A simple, performant and scalable Jax LLM!

Python 2,072 447 Updated Jan 8, 2026

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,178 898 Updated Jan 8, 2026

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

Python 176 13 Updated Jan 8, 2026

JAX bindings for the flash-attention3 kernels

C++ 18 3 Updated Jan 2, 2026

Official repository of DARE: dLLM Alignment and Reinforcement Executor

Python 150 2 Updated Jan 7, 2026

Fast and memory-efficient exact attention

Python 27 1 Updated Dec 2, 2024

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,124 645 Updated Jan 8, 2026

implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880

Python 191 14 Updated Jan 4, 2026

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 516 25 Updated Dec 23, 2025

⚙️🦀 Build modular and scalable LLM Applications in Rust

Rust 5,407 619 Updated Jan 7, 2026

A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.

Python 49 6 Updated Dec 19, 2025