Viewing a single comment thread. View all comments

TheRealSerdra t1_itm14nm wrote

I’ve done similar things and while you can continue improving, you’ll hit a wall at some point. Where that wall is depends on a few different factors. That being said, this is nothing new. Iterative self improvement has been a thing for ages and is at the heart of some of the most impressive advances in RL. This is just applying a concept to language models, not inventing a new concept


rePAN6517 t1_itmx2sz wrote

> I’ve done similar things

Did you publish?