Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. Submitted by grungabunga t3_11bb3l3 on February 25, 2023 at 3:45 AM in singularity 127 comments 140
MadDragonReborn t1_j9zo65p wrote on February 25, 2023 at 7:17 PM I would have to say that this list states the likelihood of a statement on the internet reflecting animosity toward a given group fairly accurately. Permalink −1− Spire_Citron t1_ja05kyo wrote on February 25, 2023 at 9:16 PM Yup. I think if anything this shows it probably wasn't individually programmed to respond to particular things and is just making its judgements based on the hate that it sees in its data. Permalink Parent −1−
Spire_Citron t1_ja05kyo wrote on February 25, 2023 at 9:16 PM Yup. I think if anything this shows it probably wasn't individually programmed to respond to particular things and is just making its judgements based on the hate that it sees in its data. Permalink Parent −1−
Viewing a single comment thread. View all comments