Submitted by elf7979 t3_10de78o in deeplearning
suflaj t1_j4pdd6o wrote
Reply to comment by elf7979 in Is 100 mega byte text corpus big enought to train? by elf7979
You're closer but not yet quite there - the smaller Google News Dataset W2V is trained on is 10 GB. The full one used is around 300GB IIRC
Viewing a single comment thread. View all comments