Anthropic was ordered on national security grounds to immediately disable Fable 5 and Mythos 5 after the government was presented with a jailbreak technique that Anthropic classifies as widely prevalent among competitor models as well.
Broad American consensus: The public accepts AI benefits but demands government oversight and corporate liability, with only 15 % trusting AI companies themselves.
Anthropic will make hidden request throttling in Claude transparent going forward but retains content restrictions, partly due to conflicts with the US Department of Defense over national security.
The security filter in Claude 3.5 Sonnet blocks legitimate security requests, limiting its usability for CTOs performing security audits and vulnerability assessments.
Claude Fable 5 does not permit zero-data-retention contracts and retains all prompts and outputs for 30 days for security purposes, even where organizations have ZDR agreements with older Claude models.
Anthropic calls for an aviation-like regulatory authority or commissioned private auditors to examine AI models for critical risks before their release.
Anthropic increasingly differentiates AI access by user category: the public receives Fable 5 with active security routing, while governments, large enterprises and research labs can use the less restrictive Mythos 5.
Anthropic splits Claude Fable 5 into a public version (with safeguards) and a restrictive version (Claude Mythos 5 without security layers) for verified cybersecurity experts.