Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'.
Submitted by grungabunga t3_11bb3l3 in singularity
This is just a mirror of who gets the most hatespeech.
It says more about human discourse than it says about the AI.
Edit: here is a small paragraph from the conclusion of the Article that I think is important to keep in mind:
«It is also important to remark that most sources for the biases reported here are probably unintentional and likely organically emerging from complex entanglements of institutional corpora and societal biases. For that reason, I would expect similar biases in the content moderation filters of other big tech companies.»
You mean, OpenAI was taught on the texts that had way more anti-disabled hate than anti-republican hate? Where have they found them?
It is the whole internet that is like that. As a said, it is a reflexion of our society:
You will never find people insulting "normal weighted people" or "people without a disability". So it is not surprising that the model does not perform well in those areas.
In the US, saying something is "socialism" can even be interpreted as a criticism, so I am not surprised it flags more left-winged things than right-winged.
It's not necessarily just the amount but also the type of hate.
Likely because there is a larger volume of hate content for disabilities than for republicans.
4Chan is the only place i can think of where you wouldn't get instabanned for anti-disabled hate, but considering most models are trained on Reddit it would make sense for it to be extremely biased to the left
[deleted]
So by your theory, if I go on to Twitter right now, I'm going to see pages and pages of hate speech against black people, but almost nobody saying anything about the rich?
Maybe you should rethink.
>This is just a mirror of who gets the most hatespeech.
LMAO you can't be serious that disabled people get more hate than rich people, left wingers, right wingers, gays and so on. I've seen tons of homophobia, political hate from both side of the spectrum but I've never seen hate towards a disabled person
Oh boy….
Viewing a single comment thread. View all comments