yanngrecque

yanngrecque

1 follower · 0 following

Lists (8)

Sort

Stars

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,095 644 Updated Jan 3, 2026

tw93 / Mole

🐹 Deep clean and optimize your Mac.

Shell 24,788 658 Updated Jan 3, 2026

togethercomputer / flash-attention-3

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 27 1 Updated Dec 2, 2024

yjyddq / DARE

Official repository of DARE: dLLM Alignment and Reinforcement Executor

Python 148 2 Updated Dec 27, 2025

Wenyueh / MinivLLM

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

Python 136 8 Updated Dec 31, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,420 211 Updated Jan 3, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 411 27 Updated Dec 31, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,564 190 Updated Jan 3, 2026

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 492 38 Updated Dec 31, 2025

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,161 896 Updated Jan 3, 2026

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 2,068 444 Updated Jan 3, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,171 348 Updated Dec 30, 2025

yueyang130 / DeeR-VLA

Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"

Python 122 7 Updated Feb 14, 2025

xbresson / CS5242_2025

NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025

Jupyter Notebook 404 103 Updated Apr 19, 2025

inclusionAI / AEnvironment

Standardized environment infrastructure for Agentic AI development.

Python 222 24 Updated Dec 31, 2025

radixark / miles

Python 658 69 Updated Jan 4, 2026

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 363 40 Updated Jan 2, 2026

lmgame-org / GRL

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 56 10 Updated Dec 18, 2025

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 797 116 Updated Jan 3, 2026

chaidiscovery / chai-lab

Chai-1, SOTA model for biomolecular structure prediction

Python 1,847 256 Updated Dec 3, 2025

NVIDIA / cuda-tile

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 743 46 Updated Dec 20, 2025

kabouzeid / turm

TUI for the Slurm Workload Manager

Rust 386 20 Updated Dec 23, 2025

apple / ml-sharp

Sharp Monocular View Synthesis in Less Than a Second

Python 6,405 410 Updated Dec 19, 2025

gpuweb / gpuweb

Where the GPU for the Web work happens!

Bikeshed 5,276 356 Updated Jan 2, 2026

stanford-cs336 / spring2025-lectures

Python 2,325 502 Updated Dec 3, 2025

tukuaiai / vibe-coding-cn

Forked from EnzeD/vibe-coding

我的开发经验+提示词库=vibecoding工作站；My development experience + prompt dictionary = Vibecoding workstation；ניסיון הפיתוח שלי + מילון פרומפטים = תחנת עבודה Vibecoding；私の開発経験 + プロンプト辞書 = Vibecoding ワークステーション；나…

Python 5,966 631 Updated Jan 4, 2026

yanngrecque

Lists (8)

Firebase

FM

LLM

ML

Protein

python

Triton

VLA

Stars