Submitted by Emotional-Fox-4285 t3_yoauod in deeplearning
HowdThatGoIn t1_ivi1zzn wrote
I can’t say for certain without the code but it looks like the loss is being applied to every hidden unit (as a scalar) rather than being distributed based off of each units contribution to the loss (as a vector). Check the shape of your loss as it moves through each layer?
Edit: also, are you applying the total loss or the mean loss? It should be the latter.
Emotional-Fox-4285 OP t1_ivjbz4s wrote
I send you the link to my notebook...
I am beginner ,therefore very lack of knowledge and couldn't find it out myself.
I will be grateful if you take a look of my notebook and feel free to suggest any change.
https://drive.google.com/file/d/1S5s5d6x0iwFOYk9SimiZt2U_6dLNierP/view?usp=sharing
Viewing a single comment thread. View all comments