Comment Re:Missing from the article (Score 2) 40
The risk equation is:
risk = offensive_capability * jailbreakability
According to Anthropic's own doom-saying, Mythos could literally end the whole-wide world overnight with a series of catastrophic, hypercane level cyberattacks. Currently, Fable 5 and Mythos are the only Mythos-class models available publicly. Thus, even if other models have high jailbreakability, the risk level is down-regulated by having lesser offensive_capability. Again, this is according to *Anthropic.* Obviously, jail-breaking GPT 3.5 isn't going to end the world. According to Anthropic, a jailbroken Fable 5 or Mythos is the apocalypse. Since those are the ONLY apocalypse models available, jailbreakability is uniquely severe for those models in particular. This makes Anthropic's attempt to say "but GPT-5.5!" particularly moot, since no one has claimed GPT-5.5 can usher in the cybersecurity apocalypse.
Anthropic made the claims, not me. I'm just following them to their logical conclusion.