+
Skip to content
View gagan3012's full-sized avatar
🎯
Focusing
🎯
Focusing

Sponsors

@comet-ml

Organizations

@conda-forge @UBC-NLP @EddieHubCommunity @openwater-fall2020 @ubcdsc @jupyter-naas @accelerateplus

Block or report gagan3012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Digital Mind Extension

JavaScript 4,169 709 Updated Jul 13, 2025

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

Python 777 76 Updated Jul 11, 2025

Code and data to accompany Racing Thoughts by Lepori et al. 2025

Jupyter Notebook 3 1 Updated Feb 8, 2025

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 194 32 Updated Jul 8, 2025

Reproducing Anthropic’s tracing-the-thoughts interpretability work on open models

Jupyter Notebook 4 1 Updated Apr 28, 2025
Python 156 10 Updated Apr 17, 2025

General Reasoner: Advancing LLM Reasoning Across All Domains

Python 148 7 Updated Jun 10, 2025

smolbox of recipies

Python 28 4 Updated Apr 23, 2025

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

Jupyter Notebook 59 10 Updated Jun 23, 2025

Just a plain, simple and elegant one-page theme for research/academia.

HTML 111 96 Updated May 29, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,717 676 Updated Feb 10, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 66 11 Updated Feb 19, 2025

Simple RL training for reasoning

Python 3,676 274 Updated Apr 10, 2025

Fully open reproduction of DeepSeek-R1

Python 25,019 2,330 Updated Jul 10, 2025

AllenAI's post-training codebase

Python 3,060 414 Updated Jul 13, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,002 1,491 Updated Apr 24, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,185 109 Updated Jan 24, 2025

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 980 89 Updated May 13, 2025

A series of technical report on Slow Thinking with LLM

Python 708 38 Updated Jun 9, 2025
Python 54 Updated Mar 14, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 21,169 1,849 Updated Jul 11, 2025

Deep Reasoning Translation (DRT) Project

225 9 Updated May 27, 2025

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 68 3 Updated Jun 23, 2025

FeatureAlignment = Alignment + Mechanistic Interpretability

Python 28 1 Updated Mar 8, 2025
Python 1 Updated Dec 28, 2024

Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"

Python 74 3 Updated May 20, 2025

Efficient Triton Kernels for LLM Training

Python 5,356 368 Updated Jul 11, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 866 109 Updated Jun 21, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载