Stars
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
An external provider for Llama Stack allowing for the use of RamaLama for inference.
A high-throughput and memory-efficient inference and serving engine for LLMs
Composable building blocks to build Llama Apps
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of …
This is the latest version of the internal repository from Pebble Technology providing the software to run on Pebble watches. Proprietary source code has been removed from this repository and it wi…
InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.
LS3 is a local implementation of an block based storage API coined Local Simple Storage Solution
cdoern / common
Forked from containers/commonLocation for shared common files in github.com/containers repos.
Podman: A tool for managing OCI containers and pods.