I have a set of independent graphs that have features “degree” and “strat” (either 1 or 0), and I’m trying to predict feature * strat. I trained an MLP to do this based on the features of individual nodes. No problem.
I try to train a GCN to do the same thing:
But the loss doesn’t improve and on inspecting the logits, I notice they’re all very similar and nowhere close to the labels: