Viewing a single comment thread. View all comments

mofawzy89 t1_j09kc7w wrote

I'm not sure about memory management between both but I faced thr same for BiLSTM For large models use gcp better yes tpus or nvidia A100

1