Distributed partitioning for very large graphs

erickim555 · July 1, 2021, 1:46am

Context: I’m working with heterogeneous graphs that are too large to fit on a single machine. Thus, DistDGL is an extremely attractive framework that I’m exploring!

I see that DGL has support for performing graph partitioning in a distributed manner, via ParMETIS, which is great. However, in the docs I see this:

DGL provides a script named convert_partition.py, located in the tools directory, to convert the data in the partition files into :class:dgl.DGLGraph objects and save them into files. Note: convert_partition.py runs in a single machine. In the future, we will extend it to convert graph data in parallel across multiple machines.

github.com

dmlc/dgl/blob/a90296aa9c807acc391e8ef0f4aaa2052de24923/docs/source/guide/distributed-preprocessing.rst#convert-parmetis-outputs-to-dglgraph

.. _guide-distributed-preprocessing:

7.1 Preprocessing for Distributed Training
------------------------------------------

:ref:`(中文版) <guide_cn-distributed-preprocessing>`

DGL requires to preprocess the graph data for distributed training. This includes two steps:
1) partition a graph into subgraphs, 2) assign nodes/edges with new IDs. For relatively small
graphs, DGL provides a partitioning API :func:`dgl.distributed.partition_graph` that performs
the two steps above. The API runs on one machine. Therefore, if a graph is large, users will
need a large machine to partition a graph when using this API. In addition to this API, we also
provide a solution to partition a large graph in a cluster of machines below (see Section 7.1.1).

:func:`dgl.distributed.partition_graph` supports both random partitioning
and a `Metis <http://glaros.dtc.umn.edu/gkhome/views/metis>`__-based partitioning.
The benefit of Metis partitioning is that it can generate
partitions with minimal edge cuts to reduce network communication for distributed training
and inference. DGL uses the latest version of Metis with the options optimized for the real-world
graphs with power-law distribution. After partitioning, the API constructs the partitioned results

This file has been truncated. show original

Does this imply that the graph needs to be small enough to fit on a single machine? eg convert_partition.py is a “memory bottleneck” for very-large-graph processing?

Thanks!

Eric

zhengda1936 · July 5, 2021, 12:03pm

Hello Eric,

convert_partition.py loads one partition at a time when constructing DGLGraph for each partition. Here we assume that we can load one partition into memory. This assumption aligns with distributed training, in which we load a partition to memory in each machine.

erickim555 · July 21, 2021, 11:34pm

Thanks for the explanation @zhengda1936 !

A follow-up question: does the machine that partitions the graph (eg via dgl.distributed.partition_graph()) have to be able to load the entire graph into memory in order to do the partitioning? eg partition_graph.py is a “memory bottleneck” for extremely large graphs?

github.com

dmlc/dgl/blob/master/examples/pytorch/graphsage/experimental/partition_graph.py#L53

    
      
              balance_ntypes = g.ndata['train_mask']
          else:
              balance_ntypes = None
          
          
if args.undirected:
              sym_g = dgl.to_bidirected(g, readonly=True)
              for key in g.ndata:
                  sym_g.ndata[key] = g.ndata[key]
              g = sym_g
          
          
dgl.distributed.partition_graph(g, args.dataset, args.num_parts, args.output,
                                          part_method=args.part_method,
                                          balance_ntypes=balance_ntypes,
                                          balance_edges=args.balance_edges,
                                          num_trainers_per_machine=args.num_trainers_per_machine)

zhengda1936 · July 27, 2021, 8:47am

dgl.distributed.partition_graph only works in a single machine, but we provide a distributed graph partitioning solution by using ParMETIS.

system · August 26, 2021, 8:48am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.