Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. Submitted by grungabunga t3_11bb3l3 on February 25, 2023 at 3:45 AM in singularity 127 comments 140
starstruckmon t1_ja1102i wrote on February 26, 2023 at 1:09 AM Reply to comment by Spire_Citron in Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. by grungabunga He's talking about the human preference data used for RHLF fine-tuning ( which is what makes ChatGPT from GPT3 ). It's not really that massive. Permalink Parent 1
Viewing a single comment thread. View all comments