hxdtest

hxdtest

Achievements

Stars

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,746 31,179 Updated Nov 19, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,219 25,952 Updated Nov 20, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,998 2,665 Updated Aug 12, 2024

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,600 2,394 Updated Nov 20, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,140 2,597 Updated Nov 19, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,942 1,234 Updated Oct 28, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,078 1,068 Updated Oct 29, 2025

adam-maj / tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 8,904 699 Updated Aug 18, 2024

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,732 997 Updated Nov 18, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,401 450 Updated Aug 2, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,927 286 Updated May 15, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,129 674 Updated Oct 24, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,899 748 Updated Nov 19, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,531 290 Updated Nov 20, 2025

alpa-projects / alpa

Training and serving large-scale neural networks with auto parallelization.

Python 3,166 355 Updated Dec 9, 2023

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,878 305 Updated Mar 10, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,700 269 Updated Nov 6, 2025