+
Skip to content
View ZefanW's full-sized avatar

Block or report ZefanW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 203 9 Updated Sep 26, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,078 206 Updated Oct 15, 2025

Towards a Unified View of Large Language Model Post-Training

Python 166 8 Updated Sep 8, 2025

The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''

Python 92 4 Updated Aug 15, 2025

Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).

Jupyter Notebook 61 3 Updated Feb 7, 2022

Kimi K2 is the large language model series developed by Moonshot AI team

8,356 545 Updated Sep 11, 2025

Official Repository of Absolute Zero Reasoner

Python 1,711 285 Updated Aug 24, 2025
Python 971 46 Updated Jul 2, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 886 81 Updated Sep 23, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,384 1,371 Updated Jul 9, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,174 673 Updated Oct 17, 2025

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,925 164 Updated Jul 9, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,442 173 Updated Oct 20, 2025

Extrapolating RLVR to General Domains without Verifiers

Python 174 8 Updated Aug 12, 2025

Muon is an optimizer for hidden layers in neural networks

Python 1,909 89 Updated Jul 12, 2025

Muon is Scalable for LLM Training

1,336 69 Updated Aug 3, 2025

aider is AI pair programming in your terminal

Python 38,022 3,593 Updated Oct 5, 2025
Python 6 Updated Feb 17, 2025
Python 333 20 Updated Jul 29, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,193 98 Updated Oct 6, 2025

Official implementation of BLIP3o-Series

Python 1,523 68 Updated Oct 20, 2025

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 318 15 Updated May 31, 2025

A PyTorch Native LLM Training Framework

Python 875 51 Updated Sep 12, 2025
Python 319 24 Updated Aug 29, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,095 1,656 Updated Sep 24, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,464 58 Updated Jun 14, 2025

[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Python 38 3 Updated Oct 28, 2024

Scalable toolkit for efficient model reinforcement

Python 948 159 Updated Oct 20, 2025

Dream 7B, a large diffusion language model

Python 1,023 55 Updated Sep 26, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载