这是indexloc提供的服务,不要输入任何密码
Skip to content

yhsung/Lab6-DQN-DDPG

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lab6-DQN-DDPG

  • Deadline: 12/09 (Wed) 12:00 p.m.
  • (No demo)

OpenAI Gym LundarLander-v2

Command usage

               [--capacity CAPACITY] [--batch_size BATCH_SIZE] [--lr LR]
               [--eps_decay EPS_DECAY] [--eps_min EPS_MIN] [--gamma GAMMA]
               [--freq FREQ] [--target_freq TARGET_FREQ] [--test_only]
               [--render] [--test_epsilon TEST_EPSILON] [-d DEVICE] [-m MODEL]
               [--logdir LOGDIR] [--seed SEED]

DLP DDQN Lab

optional arguments:
  -h, --help            show this help message and exit
  --warmup WARMUP       number of warmup steps (default: 10000)
  --episode EPISODE     upper limit of training episodes (default: 1200)
  --capacity CAPACITY   capacity of replay buffer (default: 10000)
  --batch_size BATCH_SIZE
                        mini batch size extract from replay buffer (default:
                        128)
  --lr LR               learning rate (default: 0.0005)
  --eps_decay EPS_DECAY
                        epsilon decay rate (default: 0.995)
  --eps_min EPS_MIN     lower bound of epsilon (default: 0.01)
  --gamma GAMMA         gamma for update Q value (default: 0.99)
  --freq FREQ           interval to update behavior network (default: 4)
  --target_freq TARGET_FREQ
                        interval to update target network (default: 1000)
  --test_only           conduct test only runs (default: False)
  --render              render display (default: False)
  --test_epsilon TEST_EPSILON
                        test epsilon (default: 0.001)
  -d DEVICE, --device DEVICE
                        device used for training / testing (default: cuda)
  -m MODEL, --model MODEL
                        path to pretrained model / model save path (default:
                        models/ddqn-2020-11-30-23-51-10.pth)
  --logdir LOGDIR       path to tensorboard log (default:
                        log/ddqn/2020-11-30-23-51-10)
  --seed SEED           random seed (default: 2021111)

About

Lab 6: DQN and DDPG

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%