Submitted by ACasualGuy t3_y6ryp0 in singularity
Rakshear t1_istzkhh wrote
Reply to comment by TemetN in Skill-Based Reinforcement Learning With Intrinsic Reward Matching by ACasualGuy
So they taught it to take the skills from one task to another? Self improvement stuff? Quick get quicker?
TemetN t1_isu2tr4 wrote
Kind of and not really. Basically it's about where in the series of tasks they figure out what skill to use.
Rakshear t1_isu78sf wrote
So this is more an analysis of how it works? And in knowing that they can focus on improvements?
TemetN t1_isubxzj wrote
They're changing the order (and removing a layer of complexity) by using an earlier part of their model to calculate skill use later. It could certainly lead to further improvements in either efficiency or potentially transfer learning down the line.
Rakshear t1_isuckop wrote
I think I get it, thank you.
Viewing a single comment thread. View all comments