How does DGL deal with out-of-memory graphs in general

lwwlwwl · February 1, 2024, 12:01am

Hi,

I’m trying to understand how DGL deals with out-of-memory graphs. edges file and features file are usually the ones that may cause the problem. For small graphs like products, I see everything is read and stored in memory beforehand (like before the training starts). However, for larger graphs, I am wondering if there is any specific way that DGL uses to avoid OOM. Specifically, does DGL support features that don’t fit in memory? How about edges?
(I saw in other posts uva is mentioned but it’s like sharing DRAM and GPU memory?) Thanks in advance

BarclayII · February 1, 2024, 2:03am

You can look at the OnDiskDataset class introduced in DGL 2.0: Composing OnDiskDataset from raw data — DGL 2.0.0 documentation. It supports loading features from disk.

Currently we do not support out-of-core graphs. We have plans to support it in the future.

lwwlwwl · February 7, 2024, 5:51pm

Thank you for the pointer.

It shows “For homogeneous graph, we just need to save edges(namely node pairs) into CSV file”. Does that mean DGL only supports CSV files when reading edges?

system · March 8, 2024, 5:52pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.