OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful knowledge management and collaboration platform. The project is mainly developed using C# and TypeScrip…

C# 2,594 329 Updated Dec 20, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,368 1,346 Updated Jul 9, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,555 190 Updated Jan 1, 2026

microsoft / rStar

Python 1,376 120 Updated Sep 12, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 11,017 1,227 Updated Jan 1, 2026

NVIDIA-NeMo / Skills

A project to improve skills of large language models

Python 729 135 Updated Jan 1, 2026

zwhe99 / DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 280 13 Updated Sep 25, 2025

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

719 54 Updated Jun 6, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,970 1,823 Updated Oct 13, 2025

JoeYing1019 / UltraTool

[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Python 66 2 Updated Aug 5, 2025

ofirpress / self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Jupyter Notebook 323 36 Updated Dec 28, 2023

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,656 1,303 Updated Dec 17, 2025

ucb-bar / hammer

Hammer: Highly Agile Masks Made Effortlessly from RTL

Python 308 71 Updated Oct 10, 2025

qiancheng0 / ToolRL

Python 404 32 Updated Oct 16, 2025

modelscope / MCPBench

The evaluation benchmark on MCP servers

Python 232 15 Updated Sep 3, 2025

modelcontextprotocol / servers

Model Context Protocol Servers

TypeScript 75,339 9,135 Updated Dec 19, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 55,048 7,025 Updated Jan 1, 2026

ComposioHQ / Composio-Function-Calling-Benchmark

Function Calling Benchmark & Testing

Jupyter Notebook 92 5 Updated Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rookielyb

Achievements

Achievements

Block or report rookielyb

Stars

FSoft-AI4Code / CodeWikiBench

aliyun / wuying-agentbay-sdk

commit-0 / commit0

zhenyuhe00 / SWE-Swiss

SkyworkAI / Skywork-OR1

snap-research / locomo

bytedance / trae-agent

developzir / gepa-mcp

MTU-Bench-Team / MTU-Bench

sierra-research / tau-bench

chenchen0103 / ACEBench

multi-swe-bench / multi-swe-bench

AIDotNet / OpenDeepWiki