Submitted by ahiddenmessi2 t3_11dzfvf in MachineLearning
ahiddenmessi2 OP t1_jacin2n wrote
Reply to comment by KingsmanVince in [D] Training transformer on RTX2060 by ahiddenmessi2
My dataset size can be varied cuz the data can be generated. Also, I will consider using gradient accumulation to improve performance too. Thanks
Viewing a single comment thread. View all comments