这是indexloc提供的服务,不要输入任何密码
Skip to content

Communication Time in Horovod - When is data being transferred? #3521

Answered by maxhgerlach
Snoeprol asked this question in Q&A
Discussion options

You must be logged in to vote

Is this the only operation where data is being sent and received?

Yes, NCCL_ALLREDUCE and NCCL_ALLGATHER are when the actual tensor payload is transferred over your network.

Apart from that there is some low-bandwidth negotiation.

You might also want to look into profiling with Nsight Systems to get a more detailed understanding, see #2723 for some hooks in Horovod.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Snoeprol
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants