Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This suggests to me that the censorship is being decided on by (biased) human moderators rather than being just some random outcome of the training data and the learning model in place: if it weren't, the model would absolutely "learn" the same thing about Mitch McConnell as it learned about Donald Trump.


I don't think it is. It's blocking on H H Holmes, Nero and Tiberius, and that seems pretty obscure for a manually curated list to me. I think it's blocking on individuals with certain properties ("I cannot write a poem that praises individuals who are known for committing atrocities").


I disagree; Donald Trump and Mitch McConnell are wildly different human beings, the model would for sure learn very different facts about the two of them.


Absolutely this.

Indeed I think it's fair to say that it'd take a lot of artificial calibration and data curation for a model trained on a range of media including statements about and by Mitch McConnell and Trump respectively not to conclude that the latter was the one much more associated with "hate" and "danger" and "violence" and whatever other parameters a LLM ends up associating with inappropriateness.

A biased liberal human moderator, on the other hand, is going to see the real world political relationships rather than the raw text and see Mitch as a very problematic figure in very much the same bracket as Trump. They're certainly not going to rate him as a less problematic figure than Hillary Clinton or Nancy Pelosi!

Same when I'm getting identically structured caveats about considering good points in the context of the bad things he did for Bill Clinton and Stalin because all the machine knows is that equivocating is favoured and both have lots of "bad things" written about them (it disallowed considering the good points of Hitler, presumably because even an LLM can deduce Godwin's law!). I'm not sure this is quite how a human moderator, irrespective of bias, would handle it




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: