Tweet
⚠️"demonstrating such jailbreaks is to show a fundamental security vulnerability of LLM’s to logic manipulation
[…] such “toy” Jailbreak examples will be used to perform actual criminal activities and cyberattacks, which will be extremely hard to detect and prevent." https://twitter.com/alexalbert__/status/1646624856430215168
(original)