Skip to content
View ormandi's full-sized avatar

Block or report ormandi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
5 stars written in Python
Clear filter

A library of reinforcement learning components and agents

Python 3,888 519 Updated Dec 25, 2025

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python 2,981 746 Updated Dec 7, 2025

Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.

Python 740 112 Updated Dec 18, 2025

A new way to communicate with LLM by sharing a portion of your screen instead of typing.

Python 9 Updated Feb 14, 2024

Fine-tuning LLMs using QLoRA

Python 1 Updated Mar 12, 2025