Lists (1)
Sort Name ascending (A-Z)
Stars
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
OpenAI Guardrails Python (Preview)
Pretraining data reconstruction scripts for Apertus
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Opensource benchmark evaluating web operators/agents performance
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A library for making RepE control vectors
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Democratizing Reinforcement Learning for LLMs
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
SkyRL: A Modular Full-stack RL Library for LLMs
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Minimal reproduction of DeepSeek R1-Zero
Scalable RL solution for advanced reasoning of language models
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A lightweight LMM-based Document Parsing Model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Solve Visual Understanding with Reinforced VLMs
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Environments for LLM Reinforcement Learning
A playbook for systematically maximizing the performance of deep learning models.
🤗 smolagents: a barebones library for agents that think in code.
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
Inference and training library for high-quality TTS models.