-
Notifications
You must be signed in to change notification settings - Fork 51
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[doc][script] Add example script and docs for DeepSeek-R1
#104
by zhuzilin
was merged Jul 25, 2025
Loading…
Tiny reduce num gpus for tests and add a train_async test
#103
by fzyzcjy
was merged Jul 25, 2025
Loading…
Refactor rollout generator to support abortable samples API
#101
by fzyzcjy
was merged Jul 25, 2025
Loading…
Super tiny rename rollout generator and extract function to avoid name conflicts in future refactor
#98
by fzyzcjy
was merged Jul 25, 2025
Loading…
refactor: extract weight updators from TrainRayActor
#91
by zhuzilin
was merged Jul 22, 2025
Loading…
refactor: optimize update_from_distributed to use fewer nccl group
#81
by zhuzilin
was merged Jul 19, 2025
Loading…
refactor: extract common data utils and add fsdp skeleton
#78
by zhuzilin
was merged Jul 18, 2025
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.