ormandi

Robert Ormandi ormandi

Software Engineer, Google Brain Large scale, distributed ML, RL systems

Achievements

5 stars written in Python

A library of reinforcement learning components and agents

Python 3,888 519 Updated Dec 25, 2025

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python 2,981 746 Updated Dec 7, 2025

Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.

Python 740 112 Updated Dec 18, 2025

A new way to communicate with LLM by sharing a portion of your screen instead of typing.

Python 9 Updated Feb 14, 2024

Forked from georgesung/llm_qlora

Fine-tuning LLMs using QLoRA

Python 1 Updated Mar 12, 2025