+
Skip to content
View kykim0's full-sized avatar

Organizations

@sisl @JuliaPOMDP @StanfordVL

Block or report kykim0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Scalable toolkit for efficient model reinforcement

Python 515 73 Updated Jul 17, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,544 2,644 Updated Jul 3, 2025

Code for "Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning", ICLR 2025.

Python 7 3 Updated Jun 30, 2025
Python 37 1 Updated Apr 16, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 609 59 Updated Jul 17, 2025

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 307 44 Updated Jun 23, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,520 322 Updated Jun 12, 2025

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

468 23 Updated Jul 12, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,865 218 Updated Jul 11, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,345 78 Updated Jun 5, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

519 18 Updated Jun 30, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,816 357 Updated Jul 17, 2025

Official Repository of Absolute Zero Reasoner

Python 1,608 272 Updated Jul 1, 2025

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 228 28 Updated May 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,461 6,675 Updated Jul 14, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,305 333 Updated Jul 12, 2025

Recent research papers about Foundation Models for Combinatorial Optimization

326 23 Updated Jul 12, 2025

This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!

124 13 Updated Jul 6, 2025

GenAI for Optimization and Decision Intelligence

Python 467 78 Updated Apr 29, 2025

GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.

Python 631 59 Updated Jul 16, 2025

A collection of agents that use Large Language Models (LLMs) to perform tasks common on our day to day jobs in cyber security.

Jupyter Notebook 137 18 Updated May 7, 2024

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 10,998 1,464 Updated Jul 17, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 6,625 641 Updated Jul 17, 2025

Mirror of official Lustre development repository http://git.whamcloud.com/

C 194 63 Updated Jul 14, 2025

Code for the Fractured Entangled Representation Hypothesis position paper!

Jupyter Notebook 135 14 Updated May 20, 2025
Jupyter Notebook 209 21 Updated Jun 21, 2025

This repository contains related work, benchmarks and datasets for the paper "Large Language Models in Finance (FinLLMs)", currently under review.

262 46 Updated Apr 10, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,260 241 Updated Jul 16, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 18,406 1,834 Updated Jul 16, 2025

Ceph is a distributed object, block, and file storage platform

C++ 15,257 6,108 Updated Jul 17, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载