I am trying to understand these two concepts by reading code, but got a few questions. Can someone give a few hints?
The code documented here (dgl/dist_context.py at master · dmlc/dgl · GitHub) says that both num_servers and num_client are deprecated (already?). If so, which alternative configurations one should use for the behaviors as described?
Question on num_sever [Assuming it is not deprecated de-facto] . If there are multiple (say 10) server instances on one machine, and they all refer to the same graph partition, are they supposed to have each of their own server_id, as well as ip:port in config file? (confused partially because there is also ‘backup’ server, for which I couldn’t tell if they exist in an agnostic way or needs to be explicitly specified with an ip+port).
I am confused by the difference between ‘DGL_NUM_CLIENTS’ and “DGL_NUM_SAMPLER”, can you explain a bit? (another example is inside “connect_to_server()” method, client is also registering RPC service, which blurs the roles among them. dgl/rpc_client.py at master · dmlc/dgl · GitHub. Does that mean in fact ‘client’ has both role of client and server at the same time?)
Thanks a lot.