DataParallel (multi-gpu) support #3

mattmacy · 2017-03-16T01:28:55Z

This re-factoring may well be too intrusive for your taste. I wanted to quickly test DataParallel support on a network that I knew worked well before applying it to my own work. I hope it's useful to you.

re-factor the DenseNet constructor so it can be passed to nn.parallel.data_parallel
scale batch size with the number of gpus
make plot.py robust to larger batch sizes
move weight initialization to train.py and add optional kaiming fan-in initialization

- scale batch size with ngpu - call plot.py with the batchsize so that it can scale it's rolling N value accordingly (and not break when batchsize is greater than 64) - add optional kaiming initialization - do weight initialization in train.py as opposed to the network constructor

Matt Macy added 2 commits March 15, 2017 18:05

refactor DenseNet to support DataParallel execution, fix PEP8 complaints

ac1744f

mattmacy closed this Mar 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DataParallel (multi-gpu) support #3

DataParallel (multi-gpu) support #3

Uh oh!

mattmacy commented Mar 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DataParallel (multi-gpu) support #3

DataParallel (multi-gpu) support #3

Uh oh!

Conversation

mattmacy commented Mar 16, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant