Skip to content
View songge25's full-sized avatar

Block or report songge25

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contexts Optical Compression

Python 21,750 1,958 Updated Oct 25, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 2 Updated Jul 28, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,529 250 Updated Nov 20, 2025

AI-Powered Photos App for the Decentralized Web 🌈💎✨

Go 39,051 2,188 Updated Dec 30, 2025

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 243 23 Updated Apr 22, 2025

Align Anything: Training All-modality Model with Feedback

Python 4,615 507 Updated Nov 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,366 336 Updated Dec 29, 2025

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Python 1,069 252 Updated May 27, 2021

Optimizing inference proxy for LLMs

Python 3,254 261 Updated Dec 25, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,044 702 Updated Feb 10, 2025

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 73,000 5,664 Updated Jan 5, 2026
Python 4,480 433 Updated Sep 14, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,639 600 Updated Dec 30, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,553 1,704 Updated Sep 24, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,625 1,174 Updated Apr 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,016 1,827 Updated Oct 13, 2025

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 491 59 Updated Jul 20, 2025

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

Python 694 69 Updated Aug 22, 2025

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Python 441 37 Updated Sep 28, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,633 2,643 Updated Dec 30, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,041 395 Updated Jan 5, 2026

Ongoing research training transformer models at scale

Python 14,785 3,450 Updated Jan 5, 2026

Retrieval and Retrieval-augmented LLMs

Python 11,093 822 Updated Dec 15, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

72,300 8,306 Updated Dec 22, 2025

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,331 279 Updated May 4, 2024

Java Geometry Expert

Java 46 22 Updated Jul 5, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,327 205 Updated Jan 2, 2026

The state-of-the-art image restoration model without nonlinear activation functions.

Python 2,822 367 Updated Jul 3, 2024

用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.

Python 258 20 Updated Aug 1, 2023
Next