Softmax-function in Hands-on Tutorial

I was watching the WWW20-Hands-on-Tutorial and I saw the implementation for GCMC:

In the GCMC paper of van Berg et al. they introduced a bilinear decoder with a softmax function. Why is it missing in the notebook. Is it becaus of simplicity?


Hi @Owm112, would you mind checking DGL’s example here:

where the bi-linear decoder was defined in, the softmax function was not explicitly exposed because we use CrossEntropyLoss in, which applies a softmax on the input:

1 Like