tqchen

🎯

Focusing

Tianqi Chen tqchen

🎯

Focusing

Machine Learning and Systems

12.1k followers · 128 following

CMU, NVIDIA
https://tqchen.com/

Achievements

x3 x4 x4

Achievements

x3 x4 x4

Highlights

Organizations

Stars

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 399 41 Updated Oct 18, 2025

flashinfer-ai / cubloaty

a size profiler for cuda binary

Python 51 Updated Oct 7, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 70,231 2,124 Updated Oct 18, 2025

msaroufim / pytorch-load-inline-highlighter

VS Code extension for syntax highlighting C++/CUDA/HIP code in PyTorch load_inline() strings

Python 8 Updated Jul 25, 2025

data-apis / array-api

RFC document, tooling and other content related to the array API standard

Python 258 52 Updated Sep 4, 2025

openai / agents.md

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 7,365 570 Updated Oct 14, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 11,880 1,808 Updated Oct 18, 2025

pypa / cibuildwheel

🎡 Build Python wheels for all the platforms with minimal configuration.

Python 2,118 291 Updated Oct 15, 2025

scikit-build / scikit-build-core

A next generation Python CMake adaptor and Python API for plugins

Python 398 72 Updated Oct 13, 2025

NVIDIA / tilus

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 380 8 Updated Oct 9, 2025

mshr-h / tvm-relax-cpp-example

Minimum example for deploying Apache TVM's Relax IR using C++ API

C++ 4 Updated Sep 20, 2025

Infini-AI-Lab / Multiverse

Python 96 9 Updated Sep 13, 2025

NVIDIA / jaxpp

JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training

Python 55 1 Updated Oct 13, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,175 96 Updated Oct 17, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,313 645 Updated Oct 18, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,868 304 Updated Mar 10, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,798 718 Updated Oct 15, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,815 884 Updated Sep 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,440 2,286 Updated Oct 18, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,924 285 Updated May 15, 2025

dmlc / tl2cgen

TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models

C++ 37 9 Updated Oct 13, 2025

yyjhao / wrait

TypeScript 48 6 Updated Mar 9, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,607 267 Updated Oct 17, 2025

dusty-nv / jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 3,831 721 Updated Oct 17, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,555 565 Updated Oct 18, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,042 663 Updated Oct 17, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,309 91 Updated Oct 11, 2025

dottxt-ai / outlines

Structured Outputs

Python 12,716 642 Updated Oct 15, 2025

bananaml / fructose

Python 746 11 Updated Apr 17, 2024

NVIDIA / cuda-python

CUDA Python: Performance meets Productivity

Python 3,007 215 Updated Oct 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tianqi Chen tqchen

Achievements

Achievements

Highlights

Organizations

Block or report tqchen

Stars

pytorch / helion

flashinfer-ai / cubloaty

astral-sh / uv

msaroufim / pytorch-load-inline-highlighter

data-apis / array-api

openai / agents.md

NVIDIA / TensorRT-LLM

pypa / cibuildwheel

scikit-build / scikit-build-core

NVIDIA / tilus

mshr-h / tvm-relax-cpp-example

Infini-AI-Lab / Multiverse

NVIDIA / jaxpp

ByteDance-Seed / Triton-distributed

ai-dynamo / dynamo

deepseek-ai / DualPipe

deepseek-ai / DeepGEMM

deepseek-ai / FlashMLA

volcengine / verl

deepseek-ai / open-infra-index

dmlc / tl2cgen

yyjhao / wrait

tile-ai / tilelang

dusty-nv / jetson-containers

pytorch / torchtitan

allenai / OLMo

mlc-ai / xgrammar

dottxt-ai / outlines

bananaml / fructose

NVIDIA / cuda-python