+
Skip to content
View akanyaani's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report akanyaani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 Updated Jul 16, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,381 4,578 Updated Oct 14, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,508 2,184 Updated Oct 14, 2025
Python 7 2 Updated Jun 3, 2025

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

Jupyter Notebook 10,621 2,661 Updated Apr 16, 2024
MDX 593 204 Updated Oct 12, 2025

🤗 Benchmark Large Language Models Reliably On Your Data

HTML 404 36 Updated Oct 2, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,847 137 Updated Aug 26, 2025

Ongoing research training transformer models at scale

Python 13,832 3,153 Updated Oct 14, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,584 1,599 Updated Jul 6, 2025

Development repository for the Triton language and compiler

MLIR 17,222 2,302 Updated Oct 14, 2025

Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".

Python 135 10 Updated Sep 8, 2025

Official implementation of the paper: "A Refined Analysis of Massive Activations in LLMs".

Python 10 3 Updated May 21, 2025

Official implementation of the "Variance control via weight rescaling in LLM pretraining" paper.

Python 5 Updated Jun 29, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,370 947 Updated Oct 14, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,791 713 Updated Oct 14, 2025

Annotations of the interesting ML papers I read

260 26 Updated Oct 13, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,254 3,581 Updated Oct 14, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,221 385 Updated Aug 12, 2025

Efficient Triton Kernels for LLM Training

Python 5,744 416 Updated Oct 14, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 12,917 1,933 Updated Aug 8, 2024

The Tensor (or Array)

Python 448 44 Updated Aug 12, 2024

Animation engine for explanatory math videos

Python 81,192 6,898 Updated Oct 14, 2025

LLM training in simple, raw C/CUDA

Cuda 27,836 3,224 Updated Jun 26, 2025

PyTorch native post-training library

Python 5,535 678 Updated Oct 14, 2025

Write scalable load tests in plain Python 🚗💨

Python 26,930 3,120 Updated Oct 14, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,007 959 Updated Jul 1, 2024

A simplified LLAMA implementation for training and inference tasks.

Python 33 3 Updated Jul 9, 2025
Python 546 43 Updated Dec 16, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,653 791 Updated Oct 2, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载