这是indexloc提供的服务,不要输入任何密码
Skip to content
This repository was archived by the owner on Aug 3, 2021. It is now read-only.

Conversation

@miguelgfierro
Copy link
Contributor

@okuchaiev small change, don't accept it if you want. The thing is that I was doing experiments and realized that using adam the algo didn't converge. However, with momentum it always converged.

I'm seeing consistently a worse performance with adam than with momentum in different DL tasks

@okuchaiev okuchaiev merged commit f1b44a5 into NVIDIA:master Feb 9, 2018
@okuchaiev
Copy link
Member

Actually in paper all experiments were done using sgd with momentum. I did try Adam but it was harder to make it work as well as momentum on this problem.
Btw, an interesting ICLR 2018 paper regarding Adam: https://iclr.cc/Conferences/2018/Schedule?showEvent=78

@miguelgfierro miguelgfierro deleted the patch-1 branch February 9, 2018 20:49
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants