Submitted by ACasualGuy t3_y6ryp0 in singularity
Rakshear t1_isu78sf wrote
Reply to comment by TemetN in Skill-Based Reinforcement Learning With Intrinsic Reward Matching by ACasualGuy
So this is more an analysis of how it works? And in knowing that they can focus on improvements?
TemetN t1_isubxzj wrote
They're changing the order (and removing a layer of complexity) by using an earlier part of their model to calculate skill use later. It could certainly lead to further improvements in either efficiency or potentially transfer learning down the line.
Rakshear t1_isuckop wrote
I think I get it, thank you.
Viewing a single comment thread. View all comments