kykim0

kykim0

Learn. Understand. Serve.

14 followers · 9 following

@google
Bay Area / Seoul

Achievements

Organizations

Starred repositories

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 515 73 Updated Jul 17, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,544 2,644 Updated Jul 3, 2025

GFNOrg / red-teaming

Code for "Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning", ICLR 2025.

Python 7 3 Updated Jun 30, 2025

UCSB-NLP-Chang / ThinkPrune

Python 37 1 Updated Apr 16, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 609 59 Updated Jul 17, 2025

SakanaAI / RLT

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 307 44 Updated Jun 23, 2025

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,520 322 Updated Jun 12, 2025

EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

468 23 Updated Jul 12, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,865 218 Updated Jul 11, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,345 78 Updated Jun 5, 2025

Eclipsess / Awesome-Efficient-Reasoning-LLMs

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

519 18 Updated Jun 30, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,816 357 Updated Jul 17, 2025

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,608 272 Updated Jul 1, 2025

cmu-l3 / l1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Jupyter Notebook 228 28 Updated May 14, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,461 6,675 Updated Jul 14, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,305 333 Updated Jul 12, 2025

ai4co / awesome-fm4co

Recent research papers about Foundation Models for Combinatorial Optimization

326 23 Updated Jul 12, 2025

YoungDubbyDu / Awesome-LLM-Agent-Optimization-Papers

This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the list. Any suggestions and PRs are welcome!

124 13 Updated Jul 6, 2025

microsoft / OptiGuide

GenAI for Optimization and Decision Intelligence

Python 467 78 Updated Apr 29, 2025

google-deepmind / mujoco_warp

GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.

Python 631 59 Updated Jul 16, 2025

NVISOsecurity / cyber-security-llm-agents

A collection of agents that use Large Language Models (LLMs) to perform tasks common on our day to day jobs in cyber security.

Jupyter Notebook 137 18 Updated May 7, 2024

google / adk-python

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 10,998 1,464 Updated Jul 17, 2025

microsoft / RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 6,625 641 Updated Jul 17, 2025

lustre / lustre-release

Mirror of official Lustre development repository http://git.whamcloud.com/

C 194 63 Updated Jul 14, 2025

akarshkumar0101 / fer

Code for the Fractured Entangled Representation Hypothesis position paper!

Jupyter Notebook 135 14 Updated May 20, 2025

google-deepmind / alphaevolve_results

Jupyter Notebook 209 21 Updated Jun 21, 2025

adlnlp / FinLLMs

This repository contains related work, benchmarks and datasets for the paper "Large Language Models in Finance (FinLLMs)", currently under review.

262 46 Updated Apr 10, 2025

meta-recsys / generative-recommenders

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,260 241 Updated Jul 16, 2025

a2aproject / A2A

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 18,406 1,834 Updated Jul 16, 2025

ceph / ceph

Ceph is a distributed object, block, and file storage platform

C++ 15,257 6,108 Updated Jul 17, 2025

kykim0

Organizations

Starred repositories

synthetic-data-generation