OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,174 673 Updated Oct 17, 2025

yuchenlin / rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,925 164 Updated Jul 9, 2025

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,442 173 Updated Oct 20, 2025

OpenBMB / RLPR

Extrapolating RLVR to General Domains without Verifiers

Python 174 8 Updated Aug 12, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 1,909 89 Updated Jul 12, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,336 69 Updated Aug 3, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 38,022 3,593 Updated Oct 5, 2025

trotsky1997 / openai_grading_fix

Python 6 Updated Feb 17, 2025

kanishkg / cognitive-behaviors

Python 210 12 Updated Mar 26, 2025

ruixin31 / Spurious_Rewards

Python 333 20 Updated Jul 29, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,193 98 Updated Oct 6, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,523 68 Updated Oct 20, 2025

MiniMax-AI / One-RL-to-See-Them-All

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 318 15 Updated May 31, 2025

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 875 51 Updated Sep 12, 2025

InternLM / InternBootcamp

Python 319 24 Updated Aug 29, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,095 1,656 Updated Sep 24, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,464 58 Updated Jun 14, 2025

calubkk / RAAT

[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Python 38 3 Updated Oct 28, 2024

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 948 159 Updated Oct 20, 2025

DreamLM / Dream

Dream 7B, a large diffusion language model

Python 1,023 55 Updated Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zefan Wang ZefanW

Achievements

Achievements

Block or report ZefanW

Stars

LengSicong / MMR1

ML-GSAI / LLaDA

TsinghuaC3I / Unify-Post-Training

RUCAIBox / Passk_Training

basusourya / mirostat

MoonshotAI / Kimi-K2

LeapLabTHU / Absolute-Zero-Reasoner

huggingface / Math-Verify

allenai / OLMoE

HW-whistleblower / True-Story-of-Pangu

open-compass / opencompass