Viewing a single comment thread. View all comments

mrconter1 t1_j4wq1zs wrote on January 18, 2023 at 8:06 PM

Reply to comment by bo_peng in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng

How does the memory scale with the context window size?