Ars Technica reports that Trump’s administration is considering government safety tests for advanced AI models before deployment. Critics argue the plan may be short-sighted and performative because DOGE cuts have weakened the US teams best positioned to conduct serious AI security reviews. The concern is that testing without staffing, transparency, and enforcement may not prevent dangerous deployments.
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
Florida sued OpenAI and CEO Sam Altman over multiple murders described as linked to ChatGPT. The state's attorney general accused Altman of an "utter disregard" for human lives. The provided excerpt does not identify the cases, explain the alleged causal links, specify the legal claims, or include OpenAI's response, so the allegations require further clarification.
A new study describes “Negation Neglect,” where LLMs fine-tuned on documents that explicitly mark claims as false still learn the claims as true. Experiments with fabricated statements found models often absorb entity-event associations more strongly than surrounding warnings or negations. The finding raises concerns for fine-tuning pipelines, misinformation handling, and AI safety datasets that include harmful or false content with disclaimers.
Anthropic is releasing Claude Opus 4.8 and highlighting the model’s “honesty” as a key improvement. The company says it trains its models to avoid unsupported claims, addressing a broader issue where AI systems sometimes jump to conclusions. Based on the provided excerpt, the update is positioned around reliability and uncertainty handling rather than a specific new tool or benchmark result.
TechCrunch frames enterprise AI as entering a new phase, where companies are no longer mainly asking whether AI is exciting. The harder question is whether it can be deployed safely at scale. Centered on a TechCrunch Disrupt 2026 discussion with a Databricks co-founder, the article points to safety and broad rollout readiness as key enterprise AI deal concerns.
Google's AI search feature, "AI Overviews," was recently found by users on the social platform X to have a rather absurd system vulnerability. When a user…
This controversy stems from strict U.S. legal restrictions on aviation accident investigation data. Under federal law, the National Transportation Safety Board…
According to a report by Ars Technica, U.S. President Donald Trump abruptly canceled an official event that had been scheduled for the signing of an executive…
According to a TechCrunch report, following a recent AI feature update to Google Search, a baffling system bug emerged: users can now cause the entire Google…
US President Donald Trump recently decided to delay signing a highly anticipated AI safety executive order. The core of the order was to establish a…
As generative AI technology advances at a breakneck pace, AI-generated text, images, audio, and video have reached a point where they are nearly…
As generative AI technology becomes more widespread, the internet is increasingly flooded with images and information that are difficult to distinguish as real…
In the latest issue of Import AI 455, Jack Clark guides readers through an exploration of a highly forward-looking and both exciting and concerning theme: AI…
As artificial intelligence (AI) technology undergoes explosive growth, cybersecurity has become a focal point of concern for governments and enterprises…
In this issue of Import AI 454, written by Jack Clark, the author begins by posing a thought-provoking question about finance and sociology: "At what point…
This issue of Import AI (Issue 453), written by Anthropic co-founder Jack Clark, centers on AI system safety, coding capabilities, and the future of humanity…
In this opinion piece published in Interconnects, prominent AI policy and technology critic Nathan Lambert delivers a sharp critique of the excessive panic…
Google DeepMind has recently published research findings on preventing harmful manipulation by AI. As large language models (LLMs) and AI Agents become…
This issue of Import AI (No. 449) dives deep into several core frontier topics in the current AI landscape, spanning technical breakthroughs and broad…
In this edition of Import AI (Issue 445), author Jack Clark guides readers through three core topics at the very frontier of AI development: the timeline for…
In this edition of Import AI 443, author Jack Clark guides readers through a far-reaching trend already underway: the internet is transforming from…
Vercel officially announced that AssistLoop, an AI collaboration platform focused on "Human-in-the-Loop (HITL)" mechanisms, has joined the Vercel Agents…
Hugging Face, the world's largest open-source AI community platform, has published an article titled "Open Responses," aimed at explaining to developers and…
In the latest issue of Import AI 440, author Jack Clark delves into three key structural trends facing AI development today: the Red Queen Effect, the…
As large language models (LLMs) are widely deployed across enterprises and various applications, ensuring the safety of their outputs and defending against…
Google DeepMind has officially released Gemma Scope 2, extending its powerful open-source model interpretability tools to the latest Gemma 3 model family. This…
Google DeepMind has announced a deepened collaboration with the UK AI Security Institute (UK AISI), with both parties committing to joint work on critical AI…
With the rapid advancement of generative AI technology, identifying the authenticity of images and defending against deepfakes has become an urgent priority…
With the rapid advancement of voice cloning technology, generating hyper-realistic synthetic voices has become remarkably easy. However, this has also…