gagan3012

🎯

Focusing

Gagan Bhatia gagan3012

🎯

Focusing

NLP Research | MLE

115 followers · 35 following

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Starred repositories

pickle-com / glass

Digital Mind Extension

JavaScript 4,169 709 Updated Jul 13, 2025

cvs-health / uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

Python 777 76 Updated Jul 11, 2025

safety-research / circuit-tracer

Python 2,150 216 Updated Jul 10, 2025

mlepori1 / Racing_Thoughts

Code and data to accompany Racing Thoughts by Lepori et al. 2025

Jupyter Notebook 3 1 Updated Feb 8, 2025

PAIR-code / interpretability

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 194 32 Updated Jul 8, 2025

VilohitT / llm-thought-tracing

Reproducing Anthropic’s tracing-the-thoughts interpretability work on open models

Jupyter Notebook 4 1 Updated Apr 28, 2025

menloresearch / ReZero

Python 156 10 Updated Apr 17, 2025

TIGER-AI-Lab / General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains

Python 148 7 Updated Jun 10, 2025

attentionmech / smolbox

smolbox of recipies

Python 28 4 Updated Apr 23, 2025

The-FinAI / Fino1

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

Jupyter Notebook 59 10 Updated Jun 23, 2025

SebastinSanty / minimal-research-theme

Just a plain, simple and elegant one-page theme for research/academia.

HTML 111 96 Updated May 29, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,717 676 Updated Feb 10, 2025

jackfsuia / nanoRLHF

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 66 11 Updated Feb 19, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,676 274 Updated Apr 10, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,019 2,330 Updated Jul 10, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,060 414 Updated Jul 13, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,002 1,491 Updated Apr 24, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,185 109 Updated Jan 24, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 980 89 Updated May 13, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 708 38 Updated Jun 9, 2025

Quinn777 / AtomThink-preview

Python 54 Updated Mar 14, 2025

Dereck0602 / Awesome_Test_Time_LLMs

113 9 Updated Mar 12, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 21,169 1,849 Updated Jul 11, 2025

krystalan / DRT

Deep Reasoning Translation (DRT) Project

225 9 Updated May 27, 2025

dmis-lab / Monet

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 68 3 Updated Jun 23, 2025

MikaStars39 / FeatureAlignment

FeatureAlignment = Alignment + Mechanistic Interpretability

Python 28 1 Updated Mar 8, 2025

joelburget / moe-sae

Python 1 Updated Dec 28, 2024

chujiezheng / LLM-Extrapolation

Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"

Python 74 3 Updated May 20, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,356 368 Updated Jul 11, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 866 109 Updated Jun 21, 2025

Starred topics

kaggle-dataset

React Native

React

Raspberry Pi

Python

MongoDB

MATLAB

macOS

Machine learning

Kotlin

See all starred topics