Open
Description
in the demo 256 code, the weights of different losses are 1,1/1.6,1/2.3,1/2.8,10/0.5. where do these hyperparameters come from?
in the paper, it says they are "inverse of the number of elements in each layer', what do you mean by "number of elements", and how to calculate the weights above?
looking forward to ur reply, thank you
Metadata
Metadata
Assignees
Labels
No labels