From da98a88f2d67af47f0604faf0b7d3bd595485fd5 Mon Sep 17 00:00:00 2001 From: robert-cronin Date: Tue, 25 Mar 2025 05:17:29 +0000 Subject: [PATCH] Add Kaito Signed-off-by: robert-cronin --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 18bab4f..c897416 100644 --- a/README.md +++ b/README.md @@ -138,6 +138,7 @@ An awesome & curated list of the best LLMOps tools for developers. | [ray-llm](https://github.com/ray-project/ray-llm) | LLMs on Ray - RayLLM | ![GitHub Badge](https://img.shields.io/github/stars/ray-project/ray-llm.svg?style=flat-square) | | [Xinference](https://github.com/xorbitsai/inference) | Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. | ![GitHub Badge](https://img.shields.io/github/stars/xorbitsai/inference.svg?style=flat-square) | | [KubeAI](https://github.com/substratusai/kubeai) | Deploy and scale machine learning models on Kubernetes. Built for LLMs, embeddings, and speech-to-text. | ![GitHub Badge](https://img.shields.io/github/stars/substratusai/kubeai.svg?style=flat-square) | +| [Kaito](https://github.com/kaito-project/kaito) | A Kubernetes operator that simplifies serving and tuning large AI models (e.g. Falcon or phi-3) using container images and GPU auto-provisioning. Includes an OpenAI-compatible server for inference and preset configurations for popular runtimes such as vLLM and transformers. | ![GitHub Badge](https://img.shields.io/github/stars/kaito-project/kaito.svg?style=flat-square) | **[⬆ back to ToC](#table-of-contents)**