Split graph for train and test based on edge weight

Panther719 · September 7, 2023, 3:30pm

Hi,
I have built a graph with edge weight named “score”. I want to split the graph for link prediction into train and test.
train_g should have all edges which have edge weights >=0.5
test_g should have all edges which have edge weights <0.5
Also, need to generate the negative samples based on the sizes of the above train_g and test_g.
Would be helpful if you could guide me on how to achieve this. Thanks

peizhou001 · September 21, 2023, 1:12am

Hi， for your question：

Dataset Splitting:The DGL API dgl.data.utils.split_dataset provides a way to split a dataset. However, it performs a random split. If you need a more specific or custom split, you can manually fetch all edges and filter them according to your requirements.
Negative Sampling:DGL also provides an API for negative sampling, dgl.sampling.global_uniform_negative_sampling. The num_samples argument in this function allows you to specify the number of negative samples to generate for each positive example.

system · October 21, 2023, 1:13am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.