+
Skip to content
View amulil's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Block or report amulil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,329 77 Updated Dec 3, 2024

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

Python 480 51 Updated Jul 18, 2025

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 170 14 Updated Jul 14, 2025

Learn every thing about AI Infra.

1 Updated Jun 21, 2025

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 8 2 Updated Jun 22, 2025
Python 75 8 Updated Jul 10, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 15,964 2,177 Updated Jul 19, 2025

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

TypeScript 8,421 851 Updated Jul 18, 2025

My learning notes/codes for ML SYS.

Python 2,953 181 Updated Jul 19, 2025

Efficient Triton Kernels for LLM Training

Python 5,386 371 Updated Jul 19, 2025

Material for gpu-mode lectures

Jupyter Notebook 4,754 475 Updated Jun 18, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 3,245 363 Updated Jul 19, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,514 230 Updated Jul 17, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,599 111 Updated Jul 7, 2025

Model Context Protocol Servers

TypeScript 60,441 7,015 Updated Jul 18, 2025

Learn how to use GPUs to accelerate deep learning.

2 Updated Jun 18, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 3,631 356 Updated Jun 7, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,848 303 Updated May 21, 2025

The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.

Python 1,959 582 Updated Jun 27, 2024

Fully open reproduction of DeepSeek-R1

Python 25,089 2,335 Updated Jul 17, 2025

s1: Simple test-time scaling

Python 6,503 754 Updated Jun 25, 2025
Python 1 Updated Jan 16, 2025

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 26,334 4,680 Updated Aug 18, 2024

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 6,834 1,566 Updated Nov 26, 2024

the rl algos implementation inspired by cleanrl

Python 2 Updated Jan 15, 2023

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 4,883 502 Updated Jun 6, 2025

nanoGPT style version of Llama 3.1

Python 1,400 85 Updated Aug 8, 2024

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 504 80 Updated Dec 6, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,460 661 Updated Jul 4, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载