Teng Ma stmatengss

♨️

Focusing

PhD, Tsinghua (16~21); Postdoc, Alibaba (21~23); Staff Engineer, Alibaba (23~present)

157 followers · 72 following

Alibaba Group
Beijing, China
stmatengss.github.io
https://scholar.google.com/citations?user=8zXo0KMAAAAJ
in/teng-ma-69a0a8115

Achievements

x3 x2

Achievements

x3 x2

Organizations

Lists (1)

Sort

LLM

1 repository

Starred repositories

3 stars written in Cuda

Clear filter

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,505 3,344 Updated Jun 26, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,974 299 Updated Dec 22, 2025

gpufs / gpufs

GPUfs - File system support for NVIDIA GPUs

Cuda 99 42 Updated Nov 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Teng Ma stmatengss

Achievements

Achievements

Organizations

Block or report stmatengss

Lists (1)

LLM

Starred repositories

karpathy / llm.c

thu-ml / SageAttention

gpufs / gpufs

Starred topics

delta-stepping