Efficiently read ~200 graphs per instance

aleSuglia · August 26, 2021, 2:55pm

Hi there,

In my use case, for each example of my dataset, I have ~200 graphs associated with it. Therefore, in my dataset reader __getitem__, I call dgl.load_graphs to load the stored information for each example. However, each call takes about ~0.3s (with peaks of 1.23s) which is quite a lot if I want to use a big batch size. Any suggestions about how to handle this use case?

VoVAllen · August 27, 2021, 6:01am

Hi,

How large is your graph?

aleSuglia · August 27, 2021, 9:25am

Each graph might have around 18 nodes. Each node has 3 set of features of different sizes (tensors).

VoVAllen · August 29, 2021, 12:30pm

Thanks. It seems not ideal for DGL to take such long time to load those graphs. Could you file an issue at github, with a pseudo code of your scenario? Thanks

system · September 28, 2021, 12:31pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.