+
Skip to content
View sbordt's full-sized avatar

Block or report sbordt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ongoing research training transformer models at scale

Python 13,885 3,165 Updated Oct 19, 2025

AllenAI's post-training codebase

Python 3,257 450 Updated Oct 19, 2025

Matplotlib styles for scientific plotting

Python 8,295 770 Updated May 13, 2025

Code for the Paper: "STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings"

Python 6 2 Updated Jul 11, 2025

Code release to accompany the paper "Persistent Pre-training Poisoning of LLMs"

Python 8 Updated Dec 10, 2024

[NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning methods with easily feature extensibility.

Python 394 100 Updated Oct 7, 2025

Open-source framework for the research and development of foundation models.

HTML 503 50 Updated Oct 20, 2025

PyTorch building blocks for the OLMo ecosystem

Python 306 57 Updated Oct 18, 2025

Reproducible, flexible LLM evaluations

Python 257 49 Updated Oct 18, 2025

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 78 6 Updated Jan 12, 2025

A lightweight suffix-sorting library

C 393 88 Updated Mar 25, 2020
Python 72 11 Updated Aug 7, 2025

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 223 21 Updated Nov 16, 2024

View model summaries in PyTorch!

Python 2,868 132 Updated Oct 13, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 166 15 Updated Jun 27, 2025

A library for unit scaling in PyTorch

Jupyter Notebook 131 11 Updated Jul 11, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 70,350 2,132 Updated Oct 20, 2025

Train transformer language models with reinforcement learning.

Python 15,946 2,241 Updated Oct 19, 2025

Fully open reproduction of DeepSeek-R1

Python 25,559 2,396 Updated Sep 8, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 450 57 Updated Sep 27, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 12,274 1,512 Updated Apr 24, 2025

DataComp for Language Models

HTML 1,378 126 Updated Sep 9, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,441 4,584 Updated Oct 15, 2025
JavaScript 28 Updated Feb 11, 2025

maximal update parametrization (µP)

Jupyter Notebook 1,611 106 Updated Jul 17, 2024

How much can we forget about Data Contamination? (ICML 2025)

Jupyter Notebook 6 2 Updated Oct 16, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,729 274 Updated Jul 18, 2025

The Paper List on Data Contamination for Large Language Models Evaluation.

101 4 Updated Sep 1, 2025

A toolkit for quantitative evaluation of data attribution methods.

Jupyter Notebook 53 Updated Jul 14, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载