Highlights
Starred repositories
A lightweight RAG agent for processing markdown documents
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
Environments for LLM Reinforcement Learning
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
Code and data to accompany Racing Thoughts by Lepori et al. 2025
PAIR.withgoogle.com and friend's work on interpretability methods
Reproducing Anthropic’s tracing-the-thoughts interpretability work on open models
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
Just a plain, simple and elegant one-page theme for research/academia.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.
Fully open reproduction of DeepSeek-R1
Minimal reproduction of DeepSeek R1-Zero
Training Large Language Model to Reason in a Continuous Latent Space
🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]