Viewing a single comment thread. View all comments

fedetask OP t1_iuduy9k wrote

My concern is that since the training process is noisy (RL) the optimization could take more time to "isolate" those features, and maybe some smarter model architecture could bias the algorithm in giving more importance to them from the beginning

2

eigenham t1_iue47f3 wrote

If you know for sure certain inputs should have a greater role in the final decision, you can help the model not lose that information layer over layer by giving it skip connections to later layers.

3