rlvr
Here are 10 public repositories matching this topic...
Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934
-
Updated
Jun 9, 2025 - Python
🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence
-
Updated
May 21, 2025 - Python
grpo to train long form QA and instructions with long-form reward model
-
Updated
Jun 23, 2025 - Python
A curated collection of research papers on LLM Tool-Integrated Reasoning (TIR), where LLMs enhance reasoning by interacting with external tools such as calculators, search engines, and code interpreters.
-
Updated
Jul 3, 2025
A curated collection of papers combining Self-Supervised Learning (SSL) with Reinforcement Learning (RL) in the context of Large Language Models (LLMs), toward autonomous agents in the Era of Experience. Inspired by “Welcome to the Era of Experience” (Silver & Sutton, 2025).
-
Updated
Jun 23, 2025
A curated list of papers on implicit-reward reinforcement learning for LLMs — no human feedback, no gold answers, no verifiable rewards.
-
Updated
May 30, 2025
-
Updated
May 16, 2025
Improve this page
Add a description, image, and links to the rlvr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlvr topic, visit your repo's landing page and select "manage topics."