Tweet

⚠️"demonstrating such jailbreaks is to show a fundamental security vulnerability of LLM’s to logic manipulation

[…] such “toy” Jailbreak examples will be used to perform actual criminal activities and cyberattacks, which will be extremely hard to detect and prevent." https://twitter.com/alexalbert__/status/1646624856430215168

(original)