Skip to content
View mustious's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mustious

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 18,268 855 Updated Jan 8, 2026

Pie: Programmable LLM Serving

Rust 83 11 Updated Jan 8, 2026

Open-source implementation of AlphaEvolve

Python 5,061 787 Updated Dec 24, 2025

Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning

Python 28 3 Updated Oct 21, 2025

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 170 32 Updated Jul 18, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,070 143 Updated Jan 8, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,147 2,979 Updated Jan 8, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,566 2,013 Updated Nov 1, 2025

A library for advanced large language model reasoning

Python 2,319 204 Updated Jun 10, 2025

[DEPRECATED] Moved to ROCm/rocm-systems repo

C++ 31 24 Updated Jan 6, 2026

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 694 147 Updated Dec 23, 2025

Deep learning at the speed of light.

Rust 2,673 181 Updated Jan 8, 2026

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,831 136 Updated Jan 17, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,065 1,832 Updated Oct 13, 2025

The Lean version manager

Rust 461 48 Updated Oct 6, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,544 497 Updated Jan 8, 2026

Machine Learning Engineering Open Book

Python 16,163 992 Updated Dec 20, 2025

Tile primitives for speedy kernels

Cuda 3,038 221 Updated Jan 8, 2026

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 451 27 Updated Mar 10, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,872 1,054 Updated Dec 29, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,962 926 Updated Dec 15, 2025

Minimalistic large language model 3D-parallelism training

Python 2,410 264 Updated Dec 11, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,032 129 Updated Apr 3, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,939 152 Updated Aug 26, 2025

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,848 163 Updated May 9, 2023

A PyTorch native platform for training generative AI models

Python 4,942 662 Updated Jan 8, 2026

Tips for Writing a Research Paper using LaTeX

TeX 3,647 405 Updated May 4, 2023

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

Python 351 33 Updated May 3, 2025
Next