A set of Docker images for training and serving models in TensorFlow
Powering Amazon custom machine learning chips
A unified framework for scalable computing
OpenMMLab Model Deployment Framework
Unified Model Serving Framework
Neural Network Compression Framework for enhanced OpenVINO
Superduper: Integrate AI models and machine learning workflows
A lightweight vision library for performing large object detection
A high-performance ML model serving framework, offers dynamic batching
Trainable models and NN optimization tools
A Pythonic framework to simplify AI service building
Everything you need to build state-of-the-art foundation models
A library for accelerating Transformer models on NVIDIA GPUs
Framework for Accelerating LLM Generation with Multiple Decoding Heads
An MLOps framework to package, deploy, monitor and manage models
LLMFlows - Simple, Explicit and Transparent LLM Apps
FlashInfer: Kernel Library for LLM Serving
Efficient few-shot learning with Sentence Transformers
Framework that is dedicated to making neural data processing
Implementation of "Tree of Thoughts
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
A computer vision framework to create and deploy apps in minutes
OpenMMLab Video Perception Toolbox