Stars
Complete solutions to the Programming Massively Parallel Processors Edition 4
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments
Subtitle Videos and add text motion graphics - https://www.supertranslate.ai/
Machine Learning Engineering Open Book
A blog where I write about research papers and blog posts I read.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
What would you do with 1000 H100s...
A comprehensive deep dive into the world of tokens
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
A collection of most of my competitive programming templates!
Running large language models on a single GPU for throughput-oriented scenarios.
An unnecessarily tiny implementation of GPT-2 in NumPy.
An autoregressive character-level language model for making more things
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Website containing illustrations about Machine Learning theory!
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Distributed SQL database in Rust, written as an educational project
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.