> To balance safety and utility, Codex was trained to identify and precisely ref...

GolfPopper · 2025-05-16T16:27:25 1747412845

>What if we need top-notch malware to take down the robot dogs lobbing mortars at our madmaxian compound?!

I wouldn't sweat it. According to it's developers, Codex understands 'malicious software', it has just been trained to say, "But I won't do that" when such requests are made to it. Judging from the recent past [1][2] getting LLMs to bypass such safeguards is pretty easy.

1.https://hiddenlayer.com/innovation-hub/novel-universal-bypas... 2.https://cyberpress.org/researchers-bypass-safeguards-in-17-p...

rowanG077 · 2025-05-16T20:15:50 1747426550

Agreed, I'm a big proponent that people should be in control of the tools they use. I don't think the approach where there is wise dicator enforcing I can't use my flathead screwdriver to screw down a phillips head screw is good. I think it's actively undermining people.

lumenwrites · 2025-05-16T16:16:40 1747412200

You gotta think about it in terms of cost vs benefit. How much damage will a malicious AI do, vs how much value will you get out of non-neutered model?

amarcheschi · 2025-05-16T16:02:38 1747411358

If I had to guess, only for the general public they'll be neutered, not for the 3 letters agencies

pixl97 · 2025-05-16T16:24:05 1747412645

TLA's have very few of their own coders, they contract everything out. Now I'm sure OAI will lend an unrestricted model to groups that pay large private contracts they won't disclose.