-
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 3 replies
-
Do the Keras examples from the Horovod repo work for you? https://github.com/horovod/horovod/tree/master/examples/keras Those examples use |
Beta Was this translation helpful? Give feedback.
-
I tested it with https://github.com/horovod/horovod/blob/master/examples/tensorflow2/tensorflow2_keras_mnist.py and it worked. This code was using tf.optimizers.Adam(scaled_lr). |
Beta Was this translation helpful? Give feedback.
-
Let's move this to Discussion as the problem is probably specific to your script. |
Beta Was this translation helpful? Give feedback.
-
@maxhgerlach |
Beta Was this translation helpful? Give feedback.
-
So I think I got it working. I was using the tensorboard callback and only adding that to my chief worker, this was causing some rank issues. Now every worker gets the tensorboard callback, and the horovod training seems to run just fine! |
Beta Was this translation helpful? Give feedback.
So I think I got it working. I was using the tensorboard callback and only adding that to my chief worker, this was causing some rank issues. Now every worker gets the tensorboard callback, and the horovod training seems to run just fine!