akanyaani

💭

I may be slow to respond.

Abhay Kumar akanyaani

💭

I may be slow to respond.

LLM Research Engineer | LLMs | Deep Learning | Transformers | NLP

60 followers · 41 following

TII
Abu Dhabi
11:50 (UTC -12:00)
@akanyaani
in/akanyaani

Achievements

Stars

Vishesht27 / Nano-Llama

Python 4 Updated Jul 16, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,381 4,578 Updated Oct 14, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,508 2,184 Updated Oct 14, 2025

arcee-ai / anymcp

Python 7 2 Updated Jun 3, 2025

fastai / numerical-linear-algebra

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

Jupyter Notebook 10,621 2,661 Updated Apr 16, 2024

3b1b / 3Blue1Brown.com

MDX 593 204 Updated Oct 12, 2025

huggingface / yourbench

Forked from sumukshashidhar/yourbench

🤗 Benchmark Large Language Models Reliably On Your Data

HTML 404 36 Updated Oct 2, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,847 137 Updated Aug 26, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 13,832 3,153 Updated Oct 14, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,584 1,599 Updated Jul 6, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,222 2,302 Updated Oct 14, 2025

bluorion-com / ZClip

Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".

Python 135 10 Updated Sep 8, 2025

bluorion-com / refine_massive_activations

Official implementation of the paper: "A Refined Analysis of Massive Activations in LLMs".

Python 10 3 Updated May 21, 2025

bluorion-com / weight_rescaling

Official implementation of the "Variance control via weight rescaling in LLM pretraining" paper.

Python 5 Updated Jun 29, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,370 947 Updated Oct 14, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,791 713 Updated Oct 14, 2025

shreyansh26 / Annotated-ML-Papers

Annotations of the interesting ML papers I read

260 26 Updated Oct 13, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,254 3,581 Updated Oct 14, 2025

arogozhnikov / einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,221 385 Updated Aug 12, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,744 416 Updated Oct 14, 2025

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 12,917 1,933 Updated Aug 8, 2024

EurekaLabsAI / tensor

The Tensor (or Array)

Python 448 44 Updated Aug 12, 2024

3b1b / manim

Animation engine for explanatory math videos

Python 81,192 6,898 Updated Oct 14, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 27,836 3,224 Updated Jun 26, 2025

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,535 678 Updated Oct 14, 2025

locustio / locust

Write scalable load tests in plain Python 🚗💨

Python 26,930 3,120 Updated Oct 14, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,007 959 Updated Jul 1, 2024

akanyaani / miniLLAMA

A simplified LLAMA implementation for training and inference tasks.

Python 33 3 Updated Jul 9, 2025

Vahe1994 / SpQR

Python 546 43 Updated Dec 16, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,653 791 Updated Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abhay Kumar akanyaani

Achievements

Achievements

Block or report akanyaani

Stars

Vishesht27 / Nano-Llama

deepspeedai / DeepSpeed

bytedance / deer-flow

arcee-ai / anymcp

fastai / numerical-linear-algebra

3b1b / 3Blue1Brown.com

huggingface / yourbench

huggingface / picotron

NVIDIA / Megatron-LM

nari-labs / dia

triton-lang / triton

bluorion-com / ZClip

bluorion-com / refine_massive_activations

bluorion-com / weight_rescaling

deepseek-ai / 3FS

deepseek-ai / DeepGEMM

shreyansh26 / Annotated-ML-Papers

Lightning-AI / pytorch-lightning

arogozhnikov / einops

linkedin / Liger-Kernel

karpathy / micrograd

EurekaLabsAI / tensor

3b1b / manim

karpathy / llm.c

meta-pytorch / torchtune

locustio / locust

karpathy / minbpe

akanyaani / miniLLAMA

Vahe1994 / SpQR

bitsandbytes-foundation / bitsandbytes