Using a batch of pytorch tensors as weighted adjacency matrices to build DGLGraph

dchang · February 12, 2021, 6:39am

Hi, I’m trying to treat a batch of tensors as a batch of adjacency matrices (batch_of_adj: tensor of size [8, 512, 512], meaning 8 graphs of 512 nodes each) and build a batch of DGLGraphs with them. Assume that each adj matrix is positive and thresholded at 0.7 (so that all values are either >0.7 or 0).

Currently, I’m doing something like this to build the graphs, pass through a model to update nodes, and stack the nodes features back to one batched tensor for further processing:

index = [adj.nonzero(as_tuple=False).t().contiguous() for adj in batch_of_adj]
values = [adj[idx[0], idx[1]] for idx, adj in zip(index, batch_of_adj)]
graphs = [dgl.graph((idx[0], idx[1])) for idx in index]

for i, g in enumerate(graphs):
    g.ndata['h'] = features[i] #features also of shape [8, 512, hidden_dim]
    g.edata['w'] = values[i] 
g = dgl.batch(graphs)

g = GraphModel(g) # GraphModel updates nodes features in batched graph g
gs = dgl.unbatch(g) #unbatch to get updated features back in batched form
graph_output = torch.stack([g.ndata['h'] for g in gs])

A weird thing I’ve noticed is that changing the value of the “threshold” hyperparameter I use to compute the weighted adjacency matrices (in this example threshold=0.7) has no effect on loss or the accuracy (even threshold=0.01 vs threshold=0.99 have identical losses), which makes me think that something is going wrong.

I’ve confirmed that different threshold values lead to graphs of different sizes (higher threshold means smaller graphs due to fewer edges). My suspicion is that there’s something about the 3 lines for index, values, and graphs that’s interfering with the threshold value having any effect on the loss even though it results in graphs with different sizes.

I have 2 main questions:

Is there a better way to construct the graphs from batched adj matrices than what I did with the 3 lines for index/values/graphs?
How would you explain the different threshold values resulting in identical loss and accuracy?

I’d really appreciate any insights!

BarclayII · February 12, 2021, 2:33pm

Your code looks correct and is the recommended way.

I didn’t see the variable threshold or something like that in your code. Maybe you should do something like the following first?

batch_of_adj = [adj.where(adj > 0.7, torch.zeros_like(adj)) 
                for adj in batch_of_adj]

dchang · February 12, 2021, 11:23pm

The thresholding function is defined elsewhere to create batch_of_adj. I’ve tried it my way and your way, but still I’m getting identical loss and metrics for various threshold values, even though they result in different graph sizes. I’m very confused as to how this is happening.

BarclayII · February 15, 2021, 5:42am

In that case there may be a problem in your GraphModel implementation, e.g. the messages are not properly used in node representation updates.

Mind if you post the implementation of your GraphModel (and your graph convolution layer if you wrote your own)?

system · March 17, 2021, 5:42am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.