estrafire t1_jc2umln wrote on March 13, 2023 at 5:15 PM Reply to comment by bo_peng in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng Any particular reason for moving from CNN to RNN? Permalink Parent 1
estrafire t1_jc2umln wrote
Reply to comment by bo_peng in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng
Any particular reason for moving from CNN to RNN?