+
Skip to content
View ziming-zh's full-sized avatar
  • University of Michigan
  • Ann Arbor
  • 20:47 (UTC -04:00)

Highlights

  • Pro

Block or report ziming-zh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DocuSnap frontend built in Andriod Studio

Kotlin 1 Updated Jul 19, 2025

A toolchain for distributed system runtime checkers

Java 7 Updated May 20, 2025

Ultra and Unified CCL

C++ 422 26 Updated Jul 19, 2025

A curated reading list for machine learning reliability research and practice

17 1 Updated Jul 14, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 1,591 99 Updated Jul 19, 2025

Automatic AI-powered test suite generator

Python 81 9 Updated Jun 10, 2025

Artifact Evaluation Scripts and Workloads for TrainCheck (OSDI'25)

Python 2 Updated May 22, 2025

A Framework for Automated Validation of Deep Learning Training Tasks

Python 36 1 Updated Jul 19, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 226 10 Updated Jul 10, 2025

JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training

Python 51 Updated Jul 8, 2025

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 306 20 Updated Apr 24, 2025

Collective communications library with various primitives for multi-machine training.

C++ 1,330 335 Updated Jul 16, 2025

Disseminated, Distributed OS for Hardware Resource Disaggregation. USENIX OSDI 2018 Best Paper.

C 494 75 Updated May 6, 2021

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,605 1,530 Updated Jun 26, 2025

你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码

HTML 21,241 2,852 Updated Mar 22, 2025

Tile primitives for speedy kernels

Cuda 2,523 160 Updated Jul 15, 2025

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 155 21 Updated Mar 27, 2025

VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models

2,204 295 Updated May 28, 2025

Create beautiful diagrams just by typing notation in plain text.

TypeScript 7,795 354 Updated Jul 10, 2025

NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading

Python 44 12 Updated Jun 16, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,508 491 Updated Jul 19, 2025

Automated Testing and Adaptive Detection of **Slow Faults** in Distributed Systems

Python 13 Updated Mar 6, 2025

Must-read papers on improving efficiency for LLM serving clusters

29 1 Updated May 28, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,610 62 Updated Jul 17, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

487 11 Updated Jun 26, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,448 119 Updated Jul 19, 2025

PyZMQ: Python bindings for zeromq

Python 3,931 649 Updated Jul 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 11,191 1,862 Updated Jul 19, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载