Can you Explain this modify about sparse emb?

lixusign · March 8, 2021, 8:46am

about in file as follow :
Sparse Emb

Why not clean the grad in the sparse emb ? could you expalain the benifit . thx very much .

lixusign · March 8, 2021, 9:24am

sorry I just see it.

but why write this 'zero_grad control ’ alone ？

in some case ， grad need to be cumulatived ?

classicsong · March 9, 2021, 3:35am

We follow the style of Pytorch optimizer. If you donot call zero_grad(), the grads will be accumulated.

lixusign · March 9, 2021, 3:44am

thx very much … 20 char

system · April 8, 2021, 3:44am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.