How to combine all layers's node embedding in Stochastic Training

DiorsLi · June 24, 2021, 7:18am

Hi All,
I am trying to implement LightGCN, a recommendation model using graph, with DGL. So it is a link prediction task on large heterograph with EdgeDataLoader, which samples some blocks for batch training, but causes that the output shape of every layer is different.

How to get final node embeddings combine all the layers？exactly I want to get：

class StochasticTwoLayerRGCN(nn.Module):
    def __init__(self, in_feat, hidden_feat, out_feat, rel_names):
        super().__init__()
        self.conv1 = dglnn.HeteroGraphConv({
                rel : dglnn.GraphConv(in_feat, hidden_feat, norm='right')
                for rel in rel_names
            })
        self.conv2 = dglnn.HeteroGraphConv({
                rel : dglnn.GraphConv(hidden_feat, out_feat, norm='right')
                for rel in rel_names
            })

    def forward(self, blocks, x):
        """
        how to combine different layers' output ？
        """
        print(x['item'].shape)  #torch.Size([91599, 64])

        x = self.conv1(blocks[0], x)
        print(x['item'].shape)  #torch.Size([91054, 64])
        print(blocks[0])
        """
        Block(num_src_nodes={'item': 91599, 'user': 52643},
               num_dst_nodes={'item': 91077, 'user': 52380},
               num_edges={('item', 'iu', 'user'): 2977926, ('user', 'ui', 'item'): 2977966},
               metagraph=[('item', 'user', 'iu'), ('user', 'item', 'ui')])
        """
        x = self.conv2(blocks[1], x)
        print(x['item'].shape)  #torch.Size([10796, 64])
        print(blocks[1])
        """
        Block(num_src_nodes={'item': 91077, 'user': 52380},
               num_dst_nodes={'item': 10736, 'user': 10094},
               num_edges={('item', 'iu', 'user'): 975459, ('user', 'ui', 'item'): 650950},
               metagraph=[('item', 'user', 'iu'), ('user', 'item', 'ui')])
        """
        return x

DiorsLi · June 28, 2021, 2:20am

Some suggestions? Please and thanks

BarclayII · June 28, 2021, 7:07am

The MFGs guarantee that the output nodes will always appear the first in the input nodes. As a consequence, the first few nodes of every layer’s output will always be the target nodes you have sampled.

So you can do something like the following:

x1 = self.conv1(blocks[0], x)
x2 = self.conv2(blocks[1], x1)
# ...
num_outputs = blocks[-1].num_dst_nodes()
return x1[:num_outputs] + x2[:num_outputs] + ...

DiorsLi · June 29, 2021, 2:30am

Hi BaraclayII, Many Thanks!
My graph is a heterograph so x1 is a dict. Following your suggestions, I implement as follow:

x1 = self.conv1(blocks[0], x)
x2 = self.conv2(blocks[1], x1)
# ...
num_items = blocks[-1].dstdata[dgl.NID]['item'].shape[0]
num_users = blocks[-1].dstdata[dgl.NID]['user'].shape[0]

item_embs = x['item'][:num_items] + x1['item'][:num_items] + ...
user_embs = x['user'][:num_users] + x1['user'][:num_users] + ...
# ...

I have implemented all layers’s node embedding weighted sum with convergence. So it seems work in a right way.

Many thanks again,
lichi

system · July 29, 2021, 2:31am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.