[D] Training a 65b LLaMA model Submitted by Business-Lead2679 t3_12618zu on March 29, 2023 at 9:27 PM in MachineLearning 27 comments 79
Business-Lead2679 OP t1_je70nka wrote on March 29, 2023 at 9:31 PM Id like to train it on those settings: EPOCHS = 3 LEARNING_RATE = 2e-5 CUTOFF_LEN = 1024 Permalink 3
Viewing a single comment thread. View all comments