Hi!
I have just started using DGL and the main motivation was that it simplifies working with graph data a lot.
However, I have encountered a following issue: whenever I try to use to test my model on Reddit dataset, it does not work.
If I use torch.nn.DataParallel
, it throws a following error: dgl._ffi.base.DGLError: Expect number of features to match number of nodes (len(u)). Got 58242 and 232965 instead.
This is probably due to the fact that Reddit dataset is not exactly compatible with PyTorch dataloader from what I can see in the code.
I also tried using torch.nn.parallel.DistributedDataParallel
(with one program - multiple GPUs approach), however, it freezes when I try to load model into torch.nn.parallel.DistributedDataParallel
.
However, because the graph is quite large, it does not fit into any of the machines that I have available.
Is there anything I can do about that? Thanks.