[R] A simple explanation of Reinforcement Learning from Human Feedback (RLHF) Submitted by JClub t3_10fh79i on January 18, 2023 at 8:05 PM in MachineLearning 15 comments 71
[D] RLHF - What type of rewards to use? Submitted by JClub t3_10emf7a on January 17, 2023 at 8:23 PM in MachineLearning 10 comments 19