Skip to content
View hanwenyuan0907's full-sized avatar

Block or report hanwenyuan0907

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,365 335 Updated Dec 29, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,788 376 Updated Oct 21, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,318 4,155 Updated Jan 4, 2026

Witness the aha moment of VLM with less than $3.

Python 4,018 289 Updated May 19, 2025

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,111 82 Updated Dec 12, 2025

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

7,177 699 Updated Jan 3, 2026

Accepted by IJCAI-24 Survey Track

Python 226 8 Updated Aug 25, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,615 1,172 Updated Apr 30, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,044 702 Updated Feb 10, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,126 6,573 Updated Nov 11, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 23,941 2,778 Updated Dec 11, 2025

LLM101n: Let's build a Storyteller

36,072 1,965 Updated Aug 1, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,302 85 Updated Jul 14, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,552 1,704 Updated Sep 24, 2025
Jupyter Notebook 6,157 1,627 Updated Jun 26, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,074 117 Updated Jul 29, 2024

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 151 12 Updated Sep 14, 2023

Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.

Python 76 2 Updated Oct 21, 2025

Official inference library for Mistral models

Jupyter Notebook 10,609 1,001 Updated Nov 21, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,107 1,101 Updated Dec 26, 2025

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

1,416 79 Updated Nov 6, 2025

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 412 26 Updated Jun 25, 2025

该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)

369 18 Updated Mar 29, 2024

Large Language Model Text Generation Inference

Python 10,717 1,249 Updated Dec 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,853 7,870 Updated Jan 4, 2026

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,750 1,609 Updated Jan 13, 2025

The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflow, keras, fastai.

Jupyter Notebook 811 39 Updated Apr 7, 2022

An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM

Jupyter Notebook 891 103 Updated Jul 18, 2025

基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。

Python 26 1 Updated Jul 29, 2023
Next