Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

Overview
Submissions
Comments

BB4evaTB12

Introduction to Reinforcement Learning with Human Feedback [D]

Submitted by BB4evaTB12 t3_10a7qmi on January 12, 2023 at 7:07 PM in MachineLearning

6 comments

14

36% of HellaSwag benchmark contains errors [D]

Submitted by BB4evaTB12 t3_zff5mh on December 7, 2022 at 9:51 PM in MachineLearning

6 comments

33

BB4evaTB12

Registered on October 29, 2021

t2_fnzr0uq9

Running Postmill