Skip to content
View mars1248's full-sized avatar

Block or report mars1248

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 5,119 339 Updated May 30, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,816 5,738 Updated May 30, 2025

使用Flutter编写的移动端

Dart 193 38 Updated Apr 11, 2022

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,650 4,401 Updated May 31, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,204 564 Updated May 31, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Python 2,612 530 Updated May 31, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2 1 Updated Feb 18, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 90,429 24,297 Updated May 31, 2025

A blazingly fast multi-language serialization framework powered by JIT and zero-copy.

Java 3,292 276 Updated May 30, 2025

LLM inference in C/C++

C++ 81,142 11,966 Updated May 31, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,475 181 Updated May 30, 2025

An Open Source Machine Learning Framework for Everyone

C++ 190,185 74,689 Updated May 31, 2025

A unified framework for privacy-preserving data analysis and machine learning

Python 2,452 440 Updated May 21, 2025

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,102 361 Updated Jan 21, 2025

Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.

Java 97 14 Updated Jun 21, 2024