Software Engineer, Google Brain
Large scale, distributed ML, RL systems
Stars
5
stars
written in Python
Clear filter
A library of reinforcement learning components and agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
A new way to communicate with LLM by sharing a portion of your screen instead of typing.
csabakecskemeti / llm_qlora
Forked from georgesung/llm_qloraFine-tuning LLMs using QLoRA




