-
-
-
pecan_human_AI_coordination Public
Human-AI coordination experiments on Overcooked
-
mbpo_pytorch_offline Public
MBPO (paper: When to trust your model: Model-based policy optimization) in offline RL settings
-
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,DDPG for discrete action space, A2C, A3C, TD3, SAC, TRPO