Latest in AI

🔥 Trending today

anthropic7 export-controls5 model-access3 ai-infrastructure3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Publishing WASM Wheels to PyPI for Use with Pyodide
Simon Willison's Weblog5 hours agoTutorial
Pyodide 314.0 removes a long-standing distribution bottleneck by allowing WebAssembly-compiled Python wheels to be published directly to PyPI, so any package author can now distribute Pyodide-compatible packages without Pyodide team involvement. Previously, the team manually built and hosted over 300 packages. Simon Willison celebrated by publishing luau-wasm — a Lua-based scripting language compiled to WASM — using Codex with GPT-5.5 to automate the packaging workflow.
OpenAI Faces State Attorneys General Investigation★ 72
TechCrunch AI12 hours agoRegulation
OpenAI is facing an investigation from state attorneys general, according to TechCrunch. The article says it is not yet clear which states are involved. Reported areas of inquiry include OpenAI's advertising policies and how the company handles health-related data, suggesting regulators are examining both consumer-facing business practices and sensitive information governance.
AINews: Fable and Mythos Access Suspended Over Cybersecurity Risk★ 76
Latent SpaceyesterdayIncident
Anthropic’s Claude Fable 5 and Mythos 5 were abruptly suspended after a US export-control directive tied to a possible jailbreak and national cybersecurity risk. The roundup frames the event as a new “model sovereignty” warning for teams relying on closed frontier APIs. It also covers Kimi-K2.7-Code, MiniMax M3, DeepSWE replacing SWE-Bench Pro, agent-inference benchmarks, sandboxing, and Gemini-SQL2.
U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78
TechCrunch AIyesterdayRegulation
TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
US Directive Targets Access to Fable 5 and Mythos 5★ 76
Simon Willison's WeblogyesterdayRegulation
Simon Willison comments on Anthropic’s statement that a US government export-control directive requires suspending access to Fable 5 and Mythos 5 for all foreign nationals, including Anthropic employees. Anthropic says the directive cites national security concerns but offers only verbal evidence of a narrow Fable 5 jailbreak. Willison notes that, as of 9:01pm ET, he still had access to Fable through claude.ai and Claude Code.
OpenAI WebRTC Audio Session Adds GPT-Realtime-2 and Document Context
Simon Willison's WeblogyesterdayNew Tool
Simon Willison revisited his OpenAI WebRTC Audio Session tool, originally built in December 2024 to test OpenAI’s realtime audio API. The update lets users choose GPT-Realtime-2, a newer realtime voice model OpenAI described as having GPT-5-class reasoning. It also adds a document-context box, allowing users to paste text before starting a browser-based voice session and discuss that material conversationally.
SpaceX, Anthropic, and OpenAI Face a Hot IPO Summer
TechCrunch AI2 days agoBusiness
TechCrunch says the IPO market is active again, but the leading names are no longer the classic FAANG companies. The episode centers on MANGOS: Meta or Microsoft, Anthropic, Nvidia, Google, OpenAI, and SpaceX. With several of these companies approaching public markets in the same window, Equity’s hosts discuss what that means for valuations, investors, and expectations for public tech companies in 2026.
Fable 5 Falls Short of GPT 5.5 on the “Final Exam” for Agents
量子位 QbitAI2 days agoBenchmark
Based only on the provided title, the article appears to discuss an “agent final exam” evaluation comparing Fable 5 with GPT 5.5. The key claim is that Fable 5, despite expectations implied by the wording, did not outperform GPT 5.5. No benchmark design, scores, task types, methodology, or broader conclusions are available from the supplied content.
UN Report Warns AI Could Consume Drinking Water for 1.3 Billion People by 2030★ 72
INSIDE 硬塞 AI2 days agoEthics
INSIDE summarizes a United Nations University report arguing that AI’s environmental cost cannot be measured by carbon alone. The report projects AI-supporting data centers could use 945 TWh of electricity annually by 2030, while cooling water demand may exceed the annual drinking-water needs of 1.3 billion people. It also says inference dominates lifecycle energy use and that concentrated cloud infrastructure deepens global inequality.
Program Claude Code, Codex, Pi and Other Agent Harnesses with AI SDK
Vercel Changelog2 days agoRelease
Vercel’s changelog entry says AI SDK can now be used to program agent harnesses including Claude Code, Codex, Pi, and other similar tools. Based on the title alone, the update appears aimed at developers who want a common programming interface around coding agents and AI assistant runtimes. No implementation details, APIs, examples, pricing, availability limits, or supported harness list beyond the named products are provided in the source text.
GitHub Availability Report: May 2026
GitHub Blog2 days agoIncident
GitHub’s May 2026 availability report details nine incidents that degraded core services across github.com, GitHub Actions, pull requests, and GitHub Copilot. The report ties broader reliability pressure to rapidly growing traffic from AI-assisted and agentic development workflows. GitHub says it is shifting more traffic to Azure, isolating major services, improving database safeguards, and strengthening failover for affected Copilot model routes.
Datasette 1.0a33 Adds JSON API Extras for Queries and Rows
Simon Willison's Weblog3 days agoRelease
Simon Willison announced Datasette 1.0a33, an alpha release that extends the existing ?_extra= JSON API pattern beyond tables to cover queries and rows. The feature is now documented and presented as a significant step toward Datasette 1.0. Willison also used Claude Fable 5 in Claude Code and GPT-5.5 xhigh in Codex Desktop to build a custom extras API explorer demonstrating the new capability.
Meshy Launches First 3D AI Agent, Calling It a ChatGPT Moment for 3D Creation
量子位 QbitAI3 days agoNew Tool
Meshy has announced what the title describes as the world’s first 3D AI Agent. The report frames the launch as a potential “ChatGPT moment” for 3D creation, suggesting a shift toward more conversational or agentic workflows. Because no article body was provided, details such as capabilities, availability, pricing, benchmarks, and supported formats are not confirmed.
OpenAI Weighs Price Cuts as Anthropic Competition Intensifies
Hacker News (AI keywords)3 days agoBusiness
OpenAI is reportedly weighing price reductions as competitive pressure from Anthropic increases. Based only on the provided title, the report appears to concern business strategy rather than a new model or product release. For developers, founders, investors, and general AI users, the key implication is that pricing may become a more important battleground among leading AI providers.
Silia: A Tiny Transformer Architecture for Sub-10M Parameter Models
r/LocalLLaMA top day3 days agoPaper
A student from India shared their first paper on r/LocalLLaMA, proposing Silia, a Transformer architecture for extremely small models. The idea is to merge attention-style dynamic mixing with SwiGLU-like nonlinear transformation, aiming to save parameters in models under roughly 10M parameters. The author frames the work as an early, small-scale exploration, limited by old hardware and restricted access to larger compute.
DeepSeek v4 Coding Scores Clash With Broader Frontier Benchmarks
r/LocalLLaMA top day3 days agoCommentary
A Reddit post questions why DeepSeek v4 can rank near the top of coding leaderboards while CAISI reportedly places it about eight months behind the US frontier. The author argues that both views may be compatible because coding benchmarks measure a narrow, heavily optimized slice of capability. For local users, the bigger question is how quantized DeepSeek v4 variants perform in real agent workflows, tool calls, cybersecurity, and abstract reasoning.
[AINews] Open Models, Model Labs vs Agent Labs, and the Untrainable★ 72
Latent Space3 days agoCommentary
This AINews issue uses Sarah Guo’s essay as a lens for current AI industry debates: where open models matter, how agent labs differ from model labs, and what cannot be trained away. It also recaps discourse around Anthropic Fable/Mythos, Fable 5’s capabilities, Google’s DiffusionGemma, and maturing agent infrastructure. The central takeaway is that durable value may lie in integration, customer translation, maintenance, and intent rather than model scores alone.
First GPT-5.6 tests arrive, targeting Mythos
量子位 QbitAI4 days agoBenchmark
The title indicates that QbitAI is covering the first hands-on tests of GPT-5.6, framed around a comparison with Mythos. Because the article body is unavailable, the testing setup, metrics, task types, and actual performance gap cannot be verified. The item is best treated as an early benchmark or model-comparison report that needs the original article for proper evaluation.
Claude Fable 5 First-Day Hands-On Tests Draw Strong Reactions
量子位 QbitAI4 days agoBenchmark
QbitAI reports that Anthropic’s Claude Fable 5 quickly drew widespread hands-on testing after release. Examples include Minecraft UI generation, Photoshop-like creative tools, browser games, websites, Three.js scenes, and coding tasks. The article highlights impressive demos and benchmark claims, but also notes failures in large codebase refactoring and high usage costs.
Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74
量子位 QbitAI4 days agoRelease
QbitAI says Anthropic introduced Claude Fable 5 for general users and Claude Mythos 5 for a small set of trusted users. The article highlights software engineering, long-context work, native vision, memory, and scientific research capabilities. It also focuses on a safety-routing design where Fable 5 downgrades high-risk requests to Claude Opus 4.8 instead of simply refusing.
Anthropic Claude Fable 5: Mythos-Class Power with Controversial Terms★ 84
Latent Space4 days agoRelease
Anthropic released Claude Fable 5 as its first broadly available Mythos-class model, alongside restricted Mythos 5 access. Benchmarks and ecosystem reports show strong gains in coding, long-horizon agentic tasks, research, and vision. The controversy centers on 30-day retention for Mythos-class traffic and silent interventions that may reduce effectiveness on frontier LLM development tasks, raising trust, reproducibility, and open AI concerns.
GPT-2: Too Dangerous To Release — A 2019 Retrospective
Hacker News (AI keywords)4 days agoCommentary
In 2019, OpenAI staged the release of GPT-2, citing fears it could enable large-scale disinformation and spam generation. The move sparked debate: was it responsible AI safety practice or a savvy PR stunt? Written in late 2022, this blog post revisits the episode now that GPT-2 looks quaint compared to GPT-3/4, asking whether the original fears were justified.
It's Not FAANG Anymore. It's MANGOS.
TechCrunch AI5 days agoCommentary
The tech industry's shorthand for power is getting an update. As SpaceX, Anthropic, and OpenAI eye massive public market debuts, a new acronym — MANGOS — is emerging to replace the decade-old FAANG. The shift signals that AI and deep tech companies are becoming the new dominant forces in capital markets, displacing the platform and consumer internet era's giants.
Apple’s AI pitch will live or die by its privacy promise
The Verge AI5 days agoCommentary
The Verge argues Apple’s WWDC 2026 AI strategy centers on privacy rather than raw capability. Apple says Siri AI and Apple Intelligence will run on-device when possible and use Private Cloud Compute only when needed. But reliance on Google Gemini, Google Cloud, Nvidia, Intel, and Google Titan hardware complicates Apple’s original privacy story, even if its default data collection remains more limited than rivals.
Cleaning up after AI rockstar developers
Hacker News (AI keywords)5 days agoOpinion
The post explores the phenomenon of "AI rockstar developers" who use AI tools to write code at breakneck speed. While appearing highly productive, they often introduce significant technical debt and architectural mess. The author highlights the growing burden on teams to clean up this AI-generated code, emphasizing the need for rigorous code review and architectural oversight.
JetBrains Mellum 2: a really good and performant model
r/LocalLLaMA top day5 days agoBenchmark
A r/LocalLLaMA user shared informal impressions of JetBrains Mellum 2, focusing on local coding-style tasks and tool calls. On an AMD Radeon RX 7900 XT with llama.cpp Vulkan and 131K context, the model reportedly generated around 111 tokens/s and stayed above 100 tokens/s near full context. The author stresses this is not a scientific benchmark, but a practical workflow-oriented test.
Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72
r/LocalLLaMA top day5 days agoRelease
Omi Health’s founder says he fine-tuned NVIDIA Parakeet TDT 0.6B v2 for clinical speech and released Omi Med STT v1 under CC-BY-4.0. The runtime supports Mac, Windows, and Linux, auto-selecting MLX, NeMo, or GGUF/parakeet.cpp backends. In the author’s held-out medical benchmark, it reports 2.37% medical-WER and 145× realtime on local A10 compute.
OpenAI files for IPO, following Anthropic★ 74
The Verge AI5 days agoBusiness
OpenAI announced Monday that it confidentially submitted a Form S-1 with the US Securities and Exchange Commission. The move follows Anthropic, which reportedly made the same filing step on June 1. The Verge frames this as part of an IPO race between the two AI rivals, but the report does not provide timing, valuation, or offering details.
Following Anthropic, OpenAI files confidentially for IPO★ 78
TechCrunch AI5 days agoBusiness
OpenAI said Monday in a blog post that it has confidentially filed for an initial public offering. The move comes a little over a week after Anthropic, its main rival, also filed to go public. TechCrunch notes that OpenAI was last valued at $852 billion post-money, making the filing a major marker in the AI sector’s race toward public markets.
Introducing FrontierCode★ 78
Hacker News (AI keywords)5 days agoBenchmark
Cognition launched FrontierCode, a coding benchmark focused on mergeability rather than only functional correctness. It evaluates correctness, tests, scope discipline, style, and repository-specific quality standards. Built with open-source maintainers and extensive quality control, it shows current frontier models still struggle: Claude Opus 4.8 scores 13.4% on the hardest Diamond subset, ahead of GPT-5.5 and Gemini 3.1 Pro.

Page 1Next →

Latest in AI

Publishing WASM Wheels to PyPI for Use with Pyodide

OpenAI Faces State Attorneys General Investigation★ 72

AINews: Fable and Mythos Access Suspended Over Cybersecurity Risk★ 76

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78

US Directive Targets Access to Fable 5 and Mythos 5★ 76

OpenAI WebRTC Audio Session Adds GPT-Realtime-2 and Document Context

SpaceX, Anthropic, and OpenAI Face a Hot IPO Summer

Fable 5 Falls Short of GPT 5.5 on the “Final Exam” for Agents

UN Report Warns AI Could Consume Drinking Water for 1.3 Billion People by 2030★ 72

Program Claude Code, Codex, Pi and Other Agent Harnesses with AI SDK

GitHub Availability Report: May 2026

Datasette 1.0a33 Adds JSON API Extras for Queries and Rows

Meshy Launches First 3D AI Agent, Calling It a ChatGPT Moment for 3D Creation

OpenAI Weighs Price Cuts as Anthropic Competition Intensifies

Silia: A Tiny Transformer Architecture for Sub-10M Parameter Models

DeepSeek v4 Coding Scores Clash With Broader Frontier Benchmarks

[AINews] Open Models, Model Labs vs Agent Labs, and the Untrainable★ 72

First GPT-5.6 tests arrive, targeting Mythos

Claude Fable 5 First-Day Hands-On Tests Draw Strong Reactions

Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74

Anthropic Claude Fable 5: Mythos-Class Power with Controversial Terms★ 84

GPT-2: Too Dangerous To Release — A 2019 Retrospective

It's Not FAANG Anymore. It's MANGOS.

Apple’s AI pitch will live or die by its privacy promise

Cleaning up after AI rockstar developers

JetBrains Mellum 2: a really good and performant model

Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72

OpenAI files for IPO, following Anthropic★ 74

Following Anthropic, OpenAI files confidentially for IPO★ 78

Introducing FrontierCode★ 78