Dataloading efficiency bottleneck at to_block

cathera · September 6, 2022, 2:43am

I tried benchmarking NeighborSampler and surprisingly found that the bottleneck is at transforming frontier to block, which takes up to 80% of the sampling time. If I move frontier to gpu before performing to_block it gets a lot faster, but this consumes a lot of gpu memory. Is there any other workaround?

BarclayII · September 7, 2022, 9:42am

You could probably try to use UVA sampling by setting use_uva=True in the DataLoader?

system · October 7, 2022, 9:43am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.