From da98a88f2d67af47f0604faf0b7d3bd595485fd5 Mon Sep 17 00:00:00 2001
From: robert-cronin <robert.owen.cronin@gmail.com>
Date: Tue, 25 Mar 2025 05:17:29 +0000
Subject: [PATCH] Add Kaito

Signed-off-by: robert-cronin <robert.owen.cronin@gmail.com>
---
 README.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/README.md b/README.md
index 18bab4f..c897416 100644
--- a/README.md
+++ b/README.md
@@ -138,6 +138,7 @@ An awesome & curated list of the best LLMOps tools for developers.
 | [ray-llm](https://github.com/ray-project/ray-llm)                          | LLMs on Ray - RayLLM                                                                                                                                                                                                                                                                                                                               | ![GitHub Badge](https://img.shields.io/github/stars/ray-project/ray-llm.svg?style=flat-square)            |
 | [Xinference](https://github.com/xorbitsai/inference)                       | Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. | ![GitHub Badge](https://img.shields.io/github/stars/xorbitsai/inference.svg?style=flat-square)            |
 | [KubeAI](https://github.com/substratusai/kubeai)                       | Deploy and scale machine learning models on Kubernetes. Built for LLMs, embeddings, and speech-to-text. | ![GitHub Badge](https://img.shields.io/github/stars/substratusai/kubeai.svg?style=flat-square)             |
+| [Kaito](https://github.com/kaito-project/kaito)                            | A Kubernetes operator that simplifies serving and tuning large AI models (e.g. Falcon or phi-3) using container images and GPU auto-provisioning. Includes an OpenAI-compatible server for inference and preset configurations for popular runtimes such as vLLM and transformers.                                                                 | ![GitHub Badge](https://img.shields.io/github/stars/kaito-project/kaito.svg?style=flat-square)            |
 
 **[⬆ back to ToC](#table-of-contents)**