Add lr_scheduler option for Onpolicy algorithm #318

ChenDRAG · 2021-03-21T10:38:10Z

linear lr decay is important for both PPO and REINFORCEMENT benchmark as in #307, You can do something like below to change lr as you like when training. My solution is quite and easy forwarded.

from torch.optim.lr_scheduler import LambdaLR
import torch.nn as nn
import torch
from torch.optim import *


class net(nn.Module):
    def __init__(self):
        super(net,self).__init__()
        self.fc = nn.Linear(1,10)
    def forward(self,x):
        return self.fc(x)
model = net()

optimizer = Adam(model.parameters(),lr = 0.01)
max_update_num=10
lambda1 = lambda epoch: (1 - epoch/max_update_num)
optimizer.scheduler = lr_scheduler.LambdaLR(optimizer,lr_lambda = lambda1)

Then pass in the optimizer and policy will change lr every update automatically.

This way you get to not adding new APIs.

add lr_scheduler option in PGPolicy/A2CPolicy/PPOPolicy

ChenDRAG and others added 3 commits March 21, 2021 18:37

le_schedular

8717ed4

change how to use # update learning rate if given lr_scheduler

e53adf6

fix test

334456e

Trinkle23897 approved these changes Mar 22, 2021

View reviewed changes

Trinkle23897 merged commit 2c11b6e into thu-ml:master Mar 22, 2021

Trinkle23897 linked an issue Apr 21, 2021 that may be closed by this pull request

Plans of releasing mujoco benchmark of onpolicy algorithms(VPG, A2C, PPO) #307

Closed

BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024

Add lr_scheduler option for Onpolicy algorithm (thu-ml#318)

5ac4102

add lr_scheduler option in PGPolicy/A2CPolicy/PPOPolicy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add lr_scheduler option for Onpolicy algorithm #318

Add lr_scheduler option for Onpolicy algorithm #318

Uh oh!

ChenDRAG commented Mar 21, 2021 •

edited

Loading

Uh oh!

Uh oh!

Add lr_scheduler option for Onpolicy algorithm #318

Add lr_scheduler option for Onpolicy algorithm #318

Uh oh!

Conversation

ChenDRAG commented Mar 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ChenDRAG commented Mar 21, 2021 •

edited

Loading