I’ve been experimenting with creating network embeddings for a heterogeneous graph using MetaPath2Vec. I was wondering if anyone has had any experience with tuning/selecting the hyperparameters (namely, the epochs, learning rate, negative sample size etc.)?
I tried comparing my embeddings generated through 25 epochs and 100 epochs and the latter seems to project the nodes into one big cluster (despite the loss continuously decreasing over every epoch), whereas 25 epochs creates sensibly separated clusters. I know this is very case-dependent, but is there a way to quantify when to stop training and selecting any other hyperparameters?