amulil

Focusing

amulil

Focusing

庾信平生无萧瑟，暮年诗赋动江关。

15 followers · 35 following

Achievements

Highlights

Developer Program Member

Lists (11)

Sort

分布式

工具

80 repositories

扩散模型

语言模型

13 repositories

Stars

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,115 263 Updated Oct 20, 2025

linkedin / fmchisel

fmchisel: Efficient Compression and Training Algorithms for Foundation Models

Python 66 7 Updated Oct 9, 2025

harvard-edge / cs249r_book

Introduction to Machine Learning Systems

Python 3,634 402 Updated Oct 20, 2025

hao-ai-lab / cse234-w25-PA

Python 41 63 Updated Mar 14, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 7,129 912 Updated Aug 31, 2025

changjonathanc / flex-nano-vllm

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 298 16 Updated Aug 7, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 3,911 214 Updated Aug 15, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,893 139 Updated Oct 20, 2025

amulil / blog

To record is already an act of resistance.

1 Updated Aug 3, 2025

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,375 84 Updated Dec 3, 2024

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

Python 616 76 Updated Oct 10, 2025

sgl-project / genai-bench

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 220 26 Updated Oct 20, 2025

amulil / aiinfra.learn

Learn every thing about AI Infra.

1 Updated Jun 21, 2025

amulil / cleanvllm

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 23 3 Updated Jun 22, 2025

Infini-AI-Lab / Multiverse

Python 97 9 Updated Sep 13, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 18,268 2,797 Updated Oct 20, 2025

AsyncFuncAI / deepwiki-open

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

Python 11,304 1,200 Updated Oct 11, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 3,910 234 Updated Oct 6, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,756 418 Updated Oct 20, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,187 518 Updated Sep 23, 2025

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,626 651 Updated Oct 20, 2025

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,862 309 Updated Oct 13, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,862 138 Updated Aug 26, 2025

modelcontextprotocol / servers

Model Context Protocol Servers

TypeScript 70,862 8,457 Updated Oct 20, 2025

amulil / gpu.learn

Learn how to use GPUs to accelerate deep learning.

2 Updated Jun 18, 2025

Infatoshi / cuda-course

Cuda 1,629 290 Updated Oct 13, 2025

luhengshiwo / LLMForEverybody

每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈

Jupyter Notebook 4,536 443 Updated Oct 13, 2025

aburkov / theLMbook

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,954 322 Updated May 21, 2025

aburkov / theMLbook

The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.

Python 1,988 587 Updated Jun 27, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,560 2,396 Updated Sep 8, 2025

amulil

Highlights

Lists (11)

AGI

Course

Deep Learning

NLP

Program

RL

Web

分布式

工具

扩散模型

语言模型

Stars