What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] Submitted by Destiny_Knight t3_118svv7 on February 22, 2023 at 8:27 AM in singularity 194 comments 493
pure_coconut_water t1_j9mletn wrote on February 23, 2023 at 1:27 AM Generalization is the end goal, not memorization. Permalink 1
Viewing a single comment thread. View all comments