+
Skip to content
Change the repository type filter

All

    Repositories list

    • slime

      Public
      slime is an LLM post-training framework for RL Scaling.
      Python
      2212.2k6125Updated Oct 20, 2025Oct 20, 2025
    • A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
      Python
      2032.9k586Updated Oct 14, 2025Oct 14, 2025
    • AgentRL

      Public
      Python
      05310Updated Oct 14, 2025Oct 14, 2025
    • MobileRL

      Public
      Python
      21220Updated Oct 10, 2025Oct 10, 2025
    • DeepDive

      Public
      DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
      Python
      1718610Updated Oct 2, 2025Oct 2, 2025
    • TDRM

      Public
      Python
      1810Updated Sep 25, 2025Sep 25, 2025
    • ReST-RL

      Public
      Reinforcing LLM Reasoning through Self-Training and Value-Guided Decoding
      Python
      01000Updated Sep 18, 2025Sep 18, 2025
    • INFTY

      Public
      INFTY Engine: An Optimization Toolkit to Support Continual AI
      Python
      926300Updated Sep 13, 2025Sep 13, 2025
    • DataSciBench: An LLM Agent Benchmark for Data Science
      Python
      33500Updated Sep 1, 2025Sep 1, 2025
    • Python
      1624040Updated Aug 18, 2025Aug 18, 2025
    • SWE-Dev

      Public
      [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
      Python
      05510Updated Jul 21, 2025Jul 21, 2025
    • Typescript SDK for Z.ai - Not yet released.
      TypeScript
      0510Updated Jul 17, 2025Jul 17, 2025
    • BiPro

      Public
      code and data for Paper: BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework(ACL 2025 main)
      Python
      0700Updated Jun 28, 2025Jun 28, 2025
    • [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
      Python
      1771.8k262Updated Jun 24, 2025Jun 24, 2025
    • TreeRL

      Public
      TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
      Python
      57140Updated Jun 16, 2025Jun 16, 2025
    • WebRL

      Public
      Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
      Python
      3046600Updated Jun 6, 2025Jun 6, 2025
    • Python
      1910Updated May 29, 2025May 29, 2025
    • code, data and model for Paper: AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models (ACL'25 main)
      Python
      1510Updated May 20, 2025May 20, 2025
    • CogKit

      Public
      Finetuning and inference tools for the CogView4 and CogVideoX model series.
      Python
      1197171Updated May 14, 2025May 14, 2025
    • Towards Large Multimodal Models as Visual Foundation Agents
      Python
      8240160Updated Apr 24, 2025Apr 24, 2025
    • Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
      Python
      03300Updated Apr 2, 2025Apr 2, 2025
    • Parameter-Efficient Fine-Tuning for Foundation Models
      39400Updated Mar 31, 2025Mar 31, 2025
    • WebGLM

      Public
      WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
      Python
      1361.6k511Updated Mar 25, 2025Mar 25, 2025
    • WhoIsWho

      Public
      KDD'23 Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit
      Python
      154360Updated Mar 19, 2025Mar 19, 2025
    • Jupyter Notebook
      11850Updated Feb 24, 2025Feb 24, 2025
    • Python
      0500Updated Feb 16, 2025Feb 16, 2025
    • T1

      Public
      RL Scaling and Test-Time Scaling (ICML'25)
      111100Updated Jan 23, 2025Jan 23, 2025
    • ReST-MCTS

      Public
      ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
      Python
      51671230Updated Jan 20, 2025Jan 20, 2025
    • LongBench

      Public
      LongBench v2 and LongBench (ACL 25'&24')
      Python
      107994605Updated Jan 15, 2025Jan 15, 2025
    • LongCite

      Public
      LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
      Python
      3250780Updated Dec 31, 2024Dec 31, 2024
    点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载