For a certain GNN model, we first initialize the node representation, (an initialization embedding layer) and then aggregate the information of neighbor nodes, and finally use the aggregated representation to calculate the loss for certain tasks, and then backpropagate .
Why don’t we use the representation of the embedding layer? Is there a misunderstanding?
What is used for other tasks is also the representation after aggregation, so what is the use of the embedding layer, only the parameters?
Thanks for your help!