ReasonablyBadass t1_ittryn7 wrote
Reply to comment by SatisfyingLatte in Where does the model accuracy increase due to increasing the model's parameters stop? Is AGI possible by just scaling models with the current transformer architecture? by elonmusk12345_
Overfitting isn't an issue anymore due to the discovery of double descent/grokking.
Viewing a single comment thread. View all comments