Stars
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling
Supercharge Your LLM with the Fastest KV Cache Layer
RikkaHub is a Android APP that supports for multiple LLM providers.
Time Blindness: Why Video-Language Models Can't See What Humans Can?
The entrance repository of Markdown presentation ecosystem
An open-source AI agent that brings the power of Gemini directly into your terminal.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
🐶 Kubernetes CLI To Manage Your Clusters In Style!
contaiNERD CTL - Docker-compatible CLI for containerd, with support for Compose, Rootless, eStargz, OCIcrypt, IPFS, ...
Hydragen: High-Throughput LLM Inference with Shared Prefixes
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Protocol Buffers - Google's data interchange format
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Label Studio is a multi-type data labeling and annotation tool with standardized output format
A powerful tool for creating fine-tuning datasets for LLM
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator