Stars
DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding applications in Kubernetes.
Containerd snapshots quota NRI plugin, user can set every container ephemeral storage, but in ephemeral storage use full pod will not restart.
verl: Volcano Engine Reinforcement Learning for LLMs
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Kubernetes-native AI serving platform for scalable model serving.
A workload for deploying LLM inference services on Kubernetes
agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.
Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.
Lightweight coding agent that runs in your terminal
💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…
Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…
The Cloud-Native API Gateway and AI Gateway
Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
A Kubernetes MCP (Model Control Protocol) server that enables interaction with Kubernetes clusters through MCP tools.
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
MCP Server for kubernetes management commands





