Major AI rivals including leaders from Anthropic, OpenAI, Microsoft, Meta, and Google DeepMind signed an open letter urging US lawmakers to close a biosecurity gap. They want companies selling synthetic DNA and RNA to screen orders for sequences that could help create dangerous pathogens. The concern is that more capable AI tools and cheaper biology infrastructure could lower barriers to misuse.
Ars Technica reports that Trump’s administration is considering government safety tests for advanced AI models before deployment. Critics argue the plan may be short-sighted and performative because DOGE cuts have weakened the US teams best positioned to conduct serious AI security reviews. The concern is that testing without staffing, transparency, and enforcement may not prevent dangerous deployments.
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
A new study describes “Negation Neglect,” where LLMs fine-tuned on documents that explicitly mark claims as false still learn the claims as true. Experiments with fabricated statements found models often absorb entity-event associations more strongly than surrounding warnings or negations. The finding raises concerns for fine-tuning pipelines, misinformation handling, and AI safety datasets that include harmful or false content with disclaimers.
Anthropic is releasing Claude Opus 4.8 and highlighting the model’s “honesty” as a key improvement. The company says it trains its models to avoid unsupported claims, addressing a broader issue where AI systems sometimes jump to conclusions. Based on the provided excerpt, the update is positioned around reliability and uncertainty handling rather than a specific new tool or benchmark result.
Google's AI search feature, "AI Overviews," was recently found by users on the social platform X to have a rather absurd system vulnerability. When a user…
This controversy stems from strict U.S. legal restrictions on aviation accident investigation data. Under federal law, the National Transportation Safety Board…
According to a report by Ars Technica, U.S. President Donald Trump abruptly canceled an official event that had been scheduled for the signing of an executive…
According to a TechCrunch report, following a recent AI feature update to Google Search, a baffling system bug emerged: users can now cause the entire Google…
US President Donald Trump recently decided to delay signing a highly anticipated AI safety executive order. The core of the order was to establish a…
As generative AI becomes widespread, discussions and experiments around applying AI to psychological counseling and mental health support have never stopped —…
As generative AI technology advances at a breakneck pace, AI-generated text, images, audio, and video have reached a point where they are nearly…
As generative AI technology becomes more widespread, the internet is increasingly flooded with images and information that are difficult to distinguish as real…
In the latest issue of Import AI 455, Jack Clark guides readers through an exploration of a highly forward-looking and both exciting and concerning theme: AI…
As artificial intelligence (AI) technology undergoes explosive growth, cybersecurity has become a focal point of concern for governments and enterprises…
In this issue of Import AI 454, written by Jack Clark, the author begins by posing a thought-provoking question about finance and sociology: "At what point…
This issue of Import AI (Issue 453), written by Anthropic co-founder Jack Clark, centers on AI system safety, coding capabilities, and the future of humanity…
In this opinion piece published in Interconnects, prominent AI policy and technology critic Nathan Lambert delivers a sharp critique of the excessive panic…
Google DeepMind has recently published research findings on preventing harmful manipulation by AI. As large language models (LLMs) and AI Agents become…
This issue of Import AI (No. 449) dives deep into several core frontier topics in the current AI landscape, spanning technical breakthroughs and broad…
In this edition of Import AI (Issue 445), author Jack Clark guides readers through three core topics at the very frontier of AI development: the timeline for…
In this edition of Import AI 443, author Jack Clark guides readers through a far-reaching trend already underway: the internet is transforming from…
Hugging Face, the world's largest open-source AI community platform, has published an article titled "Open Responses," aimed at explaining to developers and…
In the latest issue of Import AI 440, author Jack Clark delves into three key structural trends facing AI development today: the Red Queen Effect, the…
As large language models (LLMs) are widely deployed across enterprises and various applications, ensuring the safety of their outputs and defending against…
Google DeepMind has officially released Gemma Scope 2, extending its powerful open-source model interpretability tools to the latest Gemma 3 model family. This…
Google DeepMind has announced a deepened collaboration with the UK AI Security Institute (UK AISI), with both parties committing to joint work on critical AI…
Google DeepMind has announced that it will deepen its partnership with the UK government — a collaboration aimed at jointly supporting national prosperity and…
With the rapid advancement of generative AI technology, identifying the authenticity of images and defending against deepfakes has become an urgent priority…
With the rapid advancement of voice cloning technology, generating hyper-realistic synthetic voices has become remarkably easy. However, this has also…