What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] Submitted by Destiny_Knight t3_118svv7 on February 22, 2023 at 8:27 AM in singularity 194 comments 493
turnip_burrito t1_j9kgb2q wrote on February 22, 2023 at 5:13 PM Reply to comment by gelukuMLG in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight We already knew parameters aren't everything, or else we'd just be using really large feedforward networks for everything. Architecture, data, and other tricks matter too. Permalink Parent 3
Viewing a single comment thread. View all comments