Rakshear t1_istzkhh wrote on October 18, 2022 at 6:10 PM

Reply to comment by TemetN in Skill-Based Reinforcement Learning With Intrinsic Reward Matching by ACasualGuy

So they taught it to take the skills from one task to another? Self improvement stuff? Quick get quicker?

TemetN t1_isu2tr4 wrote on October 18, 2022 at 6:32 PM

Kind of and not really. Basically it's about where in the series of tasks they figure out what skill to use.

Rakshear t1_isu78sf wrote on October 18, 2022 at 7:00 PM

So this is more an analysis of how it works? And in knowing that they can focus on improvements?

TemetN t1_isubxzj wrote on October 18, 2022 at 7:30 PM

They're changing the order (and removing a layer of complexity) by using an earlier part of their model to calculate skill use later. It could certainly lead to further improvements in either efficiency or potentially transfer learning down the line.

Rakshear t1_isuckop wrote on October 18, 2022 at 7:34 PM

I think I get it, thank you.