+
Skip to content
View gagan3012's full-sized avatar
🎯
Focusing
🎯
Focusing

Sponsors

@comet-ml

Organizations

@conda-forge @UBC-NLP @EddieHubCommunity @openwater-fall2020 @ubcdsc @jupyter-naas @accelerateplus

Block or report gagan3012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
TypeScript 6 2 Updated Oct 10, 2025

A lightweight RAG agent for processing markdown documents

Python 4 1 Updated Apr 8, 2025

Nano vLLM

Python 7,050 902 Updated Aug 31, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 1,725 176 Updated Oct 9, 2025

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 39,126 5,203 Updated Oct 12, 2025
Jupyter Notebook 183 24 Updated Oct 12, 2025

Async RL Training at Scale

Python 698 112 Updated Oct 13, 2025

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Python 258 54 Updated Jul 30, 2025

Environments for LLM Reinforcement Learning

Python 3,284 388 Updated Oct 12, 2025

Deep learning at the speed of light.

Rust 2,556 166 Updated Oct 13, 2025

Digital Mind Extension

JavaScript 6,537 1,007 Updated Jul 20, 2025

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

Python 1,053 110 Updated Oct 13, 2025

Code and data to accompany Racing Thoughts by Lepori et al. 2025

Jupyter Notebook 3 1 Updated Feb 8, 2025

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 204 34 Updated Sep 23, 2025

Reproducing Anthropic’s tracing-the-thoughts interpretability work on open models

Jupyter Notebook 4 1 Updated Apr 28, 2025
Python 156 9 Updated Apr 17, 2025

General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]

Python 183 10 Updated Jun 10, 2025

smolbox of recipies

Python 28 4 Updated Apr 23, 2025

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

Jupyter Notebook 66 12 Updated Jun 23, 2025

Just a plain, simple and elegant one-page theme for research/academia.

HTML 120 97 Updated May 29, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,890 693 Updated Feb 10, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 73 11 Updated Feb 19, 2025

Simple RL training for reasoning

Python 3,763 280 Updated Aug 3, 2025

Fully open reproduction of DeepSeek-R1

Python 25,536 2,398 Updated Sep 8, 2025

AllenAI's post-training codebase

Python 3,240 447 Updated Oct 13, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,258 1,510 Updated Apr 24, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,287 130 Updated Aug 12, 2025

🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]

Python 1,063 94 Updated Aug 21, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载