farmingvillein t1_itefjav wrote
Reply to comment by LetterRip in [R] Scaling Instruction-Finetuned Language Models - Flan-PaLM- Google 2022 - 75.2% on five-shot MMLU / Forecasters expected this SOTA would need until 2024! - Public checkpoints! by Singularian2501
> Note that 540B parameters is more than 2 TB for float 32
They only provide checkpoints up to the 11B model, however (unless I'm reading things wrong), so this is a moot point, at the moment.
Viewing a single comment thread. View all comments