+

sbordt

Follow

Sebastian Bordt sbordt

Follow

13 followers · 13 following

https://sbordt.github.io/

Achievements

Achievements

Stars

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 13,885 3,165 Updated Oct 19, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,257 450 Updated Oct 19, 2025

garrettj403 / SciencePlots

Matplotlib styles for scientific plotting

Python 8,295 770 Updated May 13, 2025

codeboy5 / stamp

Code for the Paper: "STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings"

Python 6 2 Updated Jul 11, 2025

facebookresearch / pretraining-poisoning

Code release to accompany the paper "Persistent Pre-training Poisoning of LLMs"

Python 8 Updated Dec 10, 2024

locuslab / open-unlearning

[NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning methods with easily feature extensibility.

Python 394 100 Updated Oct 7, 2025

marin-community / marin

Open-source framework for the research and development of foundation models.

HTML 503 50 Updated Oct 20, 2025

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 306 57 Updated Oct 18, 2025

allenai / olmes

Reproducible, flexible LLM evaluations

Python 257 49 Updated Oct 18, 2025

tml-tuebingen / pytorch-module-monitor

Python 1 Updated Oct 16, 2025

facebookresearch / iGSM

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 78 6 Updated Jan 12, 2025

y-256 / libdivsufsort

A lightweight suffix-sorting library

C 393 88 Updated Mar 25, 2020

liujch1998 / infini-gram

Python 72 11 Updated Aug 7, 2025

allenai / wimbd

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 223 21 Updated Nov 16, 2024

TylerYep / torchinfo

View model summaries in PyTorch!

Python 2,868 132 Updated Oct 13, 2025

EleutherAI / nanoGPT-mup

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 166 15 Updated Jun 27, 2025

graphcore-research / unit-scaling

A library for unit scaling in PyTorch

Jupyter Notebook 131 11 Updated Jul 11, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 70,350 2,132 Updated Oct 20, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 15,946 2,241 Updated Oct 19, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,559 2,396 Updated Sep 8, 2025

NousResearch / Open-Reasoning-Tasks

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 450 57 Updated Sep 27, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,274 1,512 Updated Apr 24, 2025

mlfoundations / dclm

DataComp for Language Models

HTML 1,378 126 Updated Sep 9, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,441 4,584 Updated Oct 15, 2025

PAIR-code / pretraining-tda

JavaScript 28 Updated Feb 11, 2025

microsoft / mup

maximal update parametrization (µP)

Jupyter Notebook 1,611 106 Updated Jul 17, 2024

tml-tuebingen / forgetting-contamination

How much can we forget about Data Contamination? (ICML 2025)

Jupyter Notebook 6 2 Updated Oct 16, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,729 274 Updated Jul 18, 2025

lyy1994 / awesome-data-contamination

The Paper List on Data Contamination for Large Language Models Evaluation.

101 4 Updated Sep 1, 2025

dilyabareeva / quanda

A toolkit for quantitative evaluation of data attribution methods.

Jupyter Notebook 53 Updated Jul 14, 2025

点击这是indexloc提供的php浏览器服务，不要输入任何密码和下载