Pinned Loading
-
verl-pipeline
verl-pipeline PublicForked from agentica-project/verl-pipeline
Async pipelined version of Verl
Python
-
Open-Llama
Open-Llama PublicThe complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-7B PublicA large-scale 7B pretraining language model developed by BaiChuan-Inc.
-
TITAN-RL
TITAN-RL PublicTITAN-RL is a distributed reinforcement learning framework that separates policy rollout, experience storage, and training into independent microservices. This design enables flexible scaling and e…
Python
-
If the problem persists, check the GitHub status page or contact support.