TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
Anthropic released Claude Fable 5 and Claude Mythos 5 simultaneously; Fable 5 matches Mythos 5 in capability but adds strict safety classifiers, with new API fallback mechanisms for rejected requests. Both models offer 1M token context, 128K max output, January 2026 knowledge cutoff, priced at $10/$50 per million tokens — double Opus 4.x. Simon's knowledge-breadth test shows Fable 5 substantially outperforms Opus 4.8, listing dozens of his open-source projects with approximate dates from memory alone.
Anthropic has announced that its latest frontier model, Fable 5, enforces hard refusals on topics deemed too dangerous, specifically cybersecurity, biology, and chemistry. The move reflects the company's ongoing effort to balance capability with safety as models grow more powerful. For developers and researchers in these fields, the restrictions may limit practical usability in legitimate professional contexts.
Mistral AI announced it is a founding member of the NVIDIA Nemotron Coalition, a global initiative for open frontier foundation models. The partnership combines Mistral AI’s model architecture, training techniques, multimodal capabilities, and enterprise fine-tuning tools with NVIDIA compute, development tools, and synthetic data pipelines. The coalition’s first initiative is a DGX Cloud-trained base model that will support the upcoming NVIDIA Nemotron 4 family and be open-sourced for specialization.
Latent Space talks with Lukas Petersson and Axel Backlund of Andon Labs, the authors behind VendingBench. The episode focuses on evaluating Claude models across a range from Haiku to Mythos. It also discusses how they build frontier evals from scratch, with an emphasis on creating benchmarks that remain useful and meaningful over time.
Google DeepMind officially announced on April 27, 2026, a strategic partnership with the government of the Republic of Korea (South Korea). The core objective…
Google DeepMind has recently announced the strengthening of its Frontier Safety Framework (FSF) — a systematic mechanism designed to proactively identify…