[R] Train CIFAR10 in under 10 seconds on an A100 (new world record!) Submitted by tysam_and_co t3_10op6va on January 30, 2023 at 1:41 AM in MachineLearning 33 comments 145
fnbr t1_j6j9f11 wrote on January 30, 2023 at 6:55 PM Have you looked at some of the architectures that get rid of BatchNorm (e.g. NFNets)? In my experience, BatchNorm tends to be quite slow, so I wonder if there's some speed to be gained there. Permalink 2
Viewing a single comment thread. View all comments