Some questions w.r.t Pm_Pd in LGNN

sakuraiiiii · November 13, 2020, 2:34am

Hi, I have two questions w.r.t Pm_Pd implemented in DGL-LGNN. I read the original paper, and think the matrices pm and pd should be the forms in this link, respectively. So what the form is Pm_Pd in DGL since it is loaded from a pickel file in the source code?
Another question is in LGNN:

def forward(self, g, lg, pm_pd):

pm_pd is shown directly to forward method rather than computed in the feed forward process, then pm_pd should be batched before training. We can view a batched graph as several unlinked graphs, however as an example, when batch process pm, it seems each pm of the unlinked graphs should be a block in the huge matrix, which could be implemented with torch.block_diag. Finally the batched Pm matrix is a block matrix. Is that right?

mufeili · November 14, 2020, 8:12pm

pm_pd is shown directly to forward method rather than computed in the feed forward process, then pm_pd should be batched before training.

I think you are right and this happens here.

We can view a batched graph as several unlinked graphs, however as an example, when batch process pm , it seems each pm of the unlinked graphs should be a block in the huge matrix, which could be implemented with torch.block_diag . Finally the batched Pm matrix is a block matrix. Is that right?

I think the code used scipy.sparse.block_diag, which should be similar to torch.block_diag. Why is it a block matrix? I think pm_pd is a block diagonal matrix?

HQ01 · November 15, 2020, 9:06pm

I believe Pm_Pd in the tutorial example is constructed as you described in the other post. Please take SBM dataset as a reference for how to compute Pm_Pd.

sakuraiiiii · November 16, 2020, 8:58am

Yes, what I mean is a block diagonal matrix. Thank you very much.

sakuraiiiii · November 16, 2020, 9:00am

Thank you very much, but pm_pd is a single matrix, I don’t know which operation should be applied on the pm and pd matrix.

HQ01 · November 24, 2020, 3:11am

We first concatenate Pm and Pd along the row dimension to get the pm_pd matrix.

sakuraiiiii · November 24, 2020, 8:20am

Thank you for your response. I think pm_pd matrix is not generated by concatenating Pm and Pd along their row dimensions. As the docs shows,

import dgl

train_set = dgl.data.CoraBinary()
G1, pmpd1, label1 = train_set[1]

we could obtain the pm_pdmatrix.

print(pmpd1.toarray().shape)  # (689, 2684)
print(G1.num_nodes())  # 689
print(G1.num_edges())  # 2684

According to the original paper, pm and pd should be matrices whose shapes are both (num_nodes, num_edges).

HQ01 · December 21, 2020, 3:58am

Correct me if I’m wrong since LGNN’s paper has undergone significant modification since then. I recalled that LGNN work on undirected graph whilst our example from the dataset is already a directed graph. Therefore you see pm_pd’s dimension align with the node graph’s edge count.