Replies: 2 comments 1 reply
-
@henrique @orivej Is there any command to set the waiting time of communication. |
Beta Was this translation helpful? Give feedback.
0 replies
-
@powermano, I think what you are looking for here is the "stall inspector". You can configure it, for instance, via these arguments to
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
How to increase the waiting time of communication?
When saving the big fc tensor of face recognition, I need to collect cls_head_params from all gpus and machines, When using all_reduce to collect params, the waiting time is longer than 60s. How to deal with this problem.
Beta Was this translation helpful? Give feedback.
All reactions