Submitted by bo_peng t3_11f9k5g in MachineLearning
KerfuffleV2 t1_jccb5v1 wrote
Reply to comment by bo_peng in [P] ChatRWKV v2 (can run RWKV 14B with 3G VRAM), RWKV pip package, and finetuning to ctx16K by bo_peng
Sounds good! The 4bit stuff seems pretty exciting too.
By the way, not sure if you saw it but it looks like PyTorch 2.0 is close to being released: https://www.reddit.com/r/MachineLearning/comments/11s58n4/n_pytorch_20_our_next_generation_release_that_is/
They seem to be claiming you can just drop in torch.compile()
and see benefits with no code changes.
bo_peng OP t1_jccc46c wrote
I am using torch JIT so close ;)
Viewing a single comment thread. View all comments