Consistent Jailbreaks in GPT-4, o1, and o3 - General Analysis

ooli2@lemm.ee · 2 months ago

Consistent Jailbreaks in GPT-4, o1, and o3 - General Analysis

𝕯𝖎𝖕𝖘𝖍𝖎𝖙⚧ [She/Her]@lemm.ee · edit-2 2 months ago

I think a big part of it is just that many want control, they want to limit what we’re capable of doing. They especially don’t want us doing things that go against them and their will as companies. Which is why they try to block us from doing those things they dislike so much, like generating porn, or discussing violent content.

I noticed that certain prompts people used for the purpose of AI poisoning are now marked as against the terms of service on ChatGPT so the whole “control” thing doesn’t seem so crazy.