[Suggestion] Add a note about the training of Bengio et al. MLP #4

OmriKaduri · 2022-09-27T17:43:19Z

Hi @karpathy, thanks for that great repo!

Maybe it would be better to note in your code that while you're training by minimizing the CE loss, Bengio actually maximized the log-likelihood. I know that it is equivalent in this case (one-hot vectors as ground-truth), but that's not the case in general, so maybe better to note. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Suggestion] Add a note about the training of Bengio et al. MLP #4

[Suggestion] Add a note about the training of Bengio et al. MLP #4

OmriKaduri commented Sep 27, 2022

[Suggestion] Add a note about the training of Bengio et al. MLP #4

[Suggestion] Add a note about the training of Bengio et al. MLP #4

Comments

OmriKaduri commented Sep 27, 2022