Stars
Post-training with Tinker
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models