Skip to content
View jiaau's full-sized avatar
  • China
  • 08:32 (UTC +08:00)

Highlights

  • Pro

Block or report jiaau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementing scalable LLMs in pure JAX (no third-party libraries)

Python 31 1 Updated Jan 2, 2026

OpenTUI is a library for building terminal user interfaces (TUIs)

TypeScript 6,884 265 Updated Jan 2, 2026

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 7,186 880 Updated Jan 3, 2026

A general and accurate MACs / FLOPs profiler for PyTorch models

Python 633 43 Updated Jul 29, 2025

A template for modern C++ projects using CMake, Clang-Format, CI, unit testing and more, with support for downstream inclusion.

CMake 1,875 220 Updated Dec 26, 2025

🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.

CMake 5,257 477 Updated Mar 12, 2025

git worktrees + tmux windows for zero-friction parallel dev

Rust 514 22 Updated Jan 3, 2026

Minimal reproduction of OneRec

Python 810 117 Updated Dec 17, 2025

Pipeline Parallelism Emulation and Visualization

Python 74 7 Updated Jun 12, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 503 36 Updated Dec 25, 2025

An implementation of the Debug Adapter Protocol for Python

Python 2,304 183 Updated Dec 15, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,160 104 Updated Nov 23, 2025
Python 35 2 Updated Mar 7, 2025

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 93 8 Updated Aug 20, 2025

[HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.

C++ 73 6 Updated Dec 18, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,609 249 Updated Dec 18, 2025

Ring attention implementation with flash attention

Python 957 91 Updated Sep 10, 2025

A cinematic Git commit replay tool for the terminal, turning your Git history into a living, animated story.

Rust 3,861 86 Updated Jan 1, 2026

Remote vanilla PDB (over TCP sockets).

Python 303 29 Updated Oct 24, 2022

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,066 55 Updated Dec 22, 2025

A tiny debugger implement the GDB Remote Serial Protocol. Can work on i386, x86_64, ARM and PowerPC.

C 175 41 Updated Aug 16, 2022

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 481 27 Updated Nov 19, 2025
C++ 213 6 Updated Nov 19, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,945 1,099 Updated Jan 3, 2026

Ideas for projects related to Tinker

139 7 Updated Nov 6, 2025

Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025

C++ 25 2 Updated Oct 22, 2025

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

381 31 Updated Nov 11, 2025

Sampling profiler for Python programs

Rust 14,787 493 Updated Dec 15, 2025

Allow torch tensor memory to be released and resumed later

Python 196 32 Updated Dec 2, 2025
Next