+
Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. SimpleVLA-RL SimpleVLA-RL Public

    Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

    Python 268 10

  2. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 236 8

  3. TTRL TTRL Public

    TTRL: Test-Time Reinforcement Learning

    Python 699 54

  4. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.6k 95

  5. ImplicitPRM ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    Python 154 11

Repositories

Showing 5 of 5 repositories
  • Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    PRIME-RL/Entropy-Mechanism-of-RL’s past year of commit activity
    Python 236 8 6 0 Updated Jun 26, 2025
  • TTRL Public

    TTRL: Test-Time Reinforcement Learning

    PRIME-RL/TTRL’s past year of commit activity
    Python 699 MIT 54 6 (1 issue needs help) 0 Updated Jun 26, 2025
  • SimpleVLA-RL Public

    Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

    PRIME-RL/SimpleVLA-RL’s past year of commit activity
    Python 268 MIT 10 8 1 Updated Jun 20, 2025
  • PRIME Public

    Scalable RL solution for advanced reasoning of language models

    PRIME-RL/PRIME’s past year of commit activity
    Python 1,647 Apache-2.0 95 6 1 Updated Mar 18, 2025
  • ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    PRIME-RL/ImplicitPRM’s past year of commit activity
    Python 154 Apache-2.0 11 12 0 Updated Mar 14, 2025

Top languages

Python

Most used topics

Loading…

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载