Rakshear t1_isu78sf wrote on October 18, 2022 at 7:00 PM

Reply to comment by TemetN in Skill-Based Reinforcement Learning With Intrinsic Reward Matching by ACasualGuy

So this is more an analysis of how it works? And in knowing that they can focus on improvements?

TemetN t1_isubxzj wrote on October 18, 2022 at 7:30 PM

They're changing the order (and removing a layer of complexity) by using an earlier part of their model to calculate skill use later. It could certainly lead to further improvements in either efficiency or potentially transfer learning down the line.

Rakshear t1_isuckop wrote on October 18, 2022 at 7:34 PM

I think I get it, thank you.