Highlights
- Pro
Stars
A curated list of behavioral foundation model (BFM) papers, articles, tutorials, slides and projects
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
[CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation
Implementations of Intention-conditioned Flow Occupancy Models (InFOM)
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …
RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
A curated list of Diffusion Model in RL resources (continually updated)
Code from the Deep Reinforcement Learning in Action book from Manning, Inc
Access latex source of any arxiv.org paper directly on overleaf
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
Make Zotero effective for us LaTeX holdouts
Using advances in generative modeling to learn reward functions from unlabeled videos.
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
[ICLR 24 Oral/Outstanding Paper Honorable Mention Award 🎉]
Odyssey: Empowering Minecraft Agents with Open-World Skills
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
[ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites