Can you Explain this modify about sparse emb?

about in file as follow :
Sparse Emb

Why not clean the grad in the sparse emb ? could you expalain the benifit . thx very much .

rgcn dist demo

sorry I just see it.

but why write this 'zero_grad control ’ alone ?

in some case , grad need to be cumulatived ?

We follow the style of Pytorch optimizer. If you donot call zero_grad(), the grads will be accumulated.

1 Like

thx very much … 20 char

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.