Hi,
I am trying neighbourhood sampling with GCN on OGBL-PPA Dataset for a link prediction task. I have implemented a 4-layer full batch GCN model on a 11 GB GPU. I was able to fit a model of 50K parameters onto the GPU. The model runs about 42 epochs in 12 hours of training.
On the other hand, when I have implemented neighbourhood sampling for the same model, I was able fit a significantly larger model of 100K parameters but the model only able to run 3 epochs in 12 hours of training time. I have followed the implementation as described here.
We can see a significant increase in training time for each epoch. Why is that the case? What is causing the overhead that is significantly increasing the training time?