SqueezeAILab
SqueezeAI is part of Berkeley AI Research Lab at UC Berkeley focused on AI Systems research.
Popular repositories Loading
-
LLMCompiler
LLMCompiler Public[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
-
-
SqueezedAttention
SqueezedAttention PublicSQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
Repositories
Showing 10 of 10 repositories
- KVQuant Public
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
SqueezeAILab/KVQuant’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…