Exif Smuggling is a security PoC showing how attackers can embed hidden instructions in image EXIF metadata fields to perform indirect prompt injection against vision-capable AI models. When AI systems parse images alongside their metadata, embedded malicious text may be processed as legitimate instructions, bypassing standard input filters. Developers building AI apps with image upload features should strip or sanitize EXIF data before passing content to language models.
Simon Willison describes his latest attempt to safely run Python plugin-style code inside his own applications. The alpha package micropython-wasm uses MicroPython compiled to WebAssembly, executed through the maintained wasmtime Python library. His goals include clean PyPI installation, CPU and memory limits, controlled file and network access, host functions, and reliable documentation.
Simon Willison notes that OpenAI’s previously teased Lockdown Mode is now live for eligible personal and self-serve Business ChatGPT accounts. The feature does not stop prompt injections from appearing in content, but limits outbound network requests that could leak sensitive data. He sees it as a direct mitigation for the exfiltration leg of the “Lethal Trifecta,” while implying default ChatGPT settings are not robust against determined data theft attempts.
Published on UCL's Bentham's Gaze blog, this research analyzes GPS cryptographic signals over a 19-year span, likening the satellites to 'quiet numbers stations.' The authors explore the evolution of GPS encryption (such as military P(Y) code and civilian authentication), evaluating their cryptographic strength and potential vulnerabilities using modern computational analysis.
Anthropic explains how process sandboxes, VMs, filesystem boundaries, and egress controls limit what Claude agents can access. Claude.ai uses gVisor; local Claude Code uses Seatbelt on macOS and Bubblewrap on Linux; Cowork runs in a full VM. Simon Willison highlights the documentation quality, notes a previously missed file-exfiltration path, and plans to revisit Anthropic's open-source srt tool.
Ars Technica reports that a developer frustrated with vibe coders slipped an undisclosed prompt injection into jqwik-related code. The injected text allegedly instructed AI coding agents to delete application output. The incident highlights a new supply-chain risk: source code and project text can become adversarial instructions for agentic coding tools.
Daniel Stenberg says the curl security team is facing an unprecedented surge of credible, detailed AI-assisted vulnerability reports. Incoming reports are now 4-5 times higher than in 2024 and twice the 2025 rate, averaging more than one per day. The upside is that recent curl vulnerabilities have generally been LOW or MEDIUM severity, with the last HIGH CVE published in October 2023.
Ars Technica reports that Starlette, a Python package with about 325 million weekly downloads, has a critical vulnerability called BadHost. The flaw can let crafted Host headers confuse request.url.path, potentially bypassing middleware-based path authorization. AI infrastructure using FastAPI or Starlette, including vLLM, LiteLLM, MCP servers, LLM proxies, and agent frameworks, should upgrade Starlette and audit custom middleware.
As AI chatbots adopt increasingly sophisticated personas, hackers are shifting from basic prompt injections to social engineering attacks targeting these "personalities." Researchers warn that manipulating a chatbot's defined role (e.g., customer service or empathetic companion) makes it easier to bypass safety guardrails. This evolution poses a significant threat to agentic AI workflows that rely on consistent role-playing and external data integration.
In this Latent Space interview, the hosts hold an in-depth conversation with Ivan Burazin, co-founder and CEO of Daytona. Daytona originally started as an…
Well-known tech blogger Simon Willison has analyzed the announcements from Google I/O 2026. Since many major announcements are still in the "coming soon"…
The official Vercel Changelog announced that developers can now run Claude Managed Agents directly in Vercel Sandbox (sandbox environment). As AI Agents —…
This issue of Import AI 457, written by Jack Clark, delves into three forward-looking and stylistically distinct topics in the field of artificial…
According to a report by Ars Technica, corporate bug bounty programs are currently being bombarded with an "endless" stream of AI-generated junk reports (AI…
This report stems from Simon Willison's compilation of Terence Eden's follow-up coverage. The incident began when the UK's National Health Service (NHS), upon…
Simon Willison has released version 0.1a1 — the latest early alpha — of `datasette-agent`, an AI agent plugin for his well-known open-source data exploration…
In the current era of booming generative AI, one of the greatest challenges enterprises and developers face when adopting large language models (LLMs) is "data…
Hugging Face has officially announced that its popular open-source model weight storage format, Safetensors, has joined the PyTorch Foundation. This is an…
With the explosion of AI Agent technology, developers are shifting from building simple "chat interfaces" to constructing Agent systems capable of autonomously…
As the Notion platform has gradually evolved into a broader ecosystem, enabling users and developers to run custom scripts and automated workflows within…
In the current evolution of AI applications, AI agents have advanced from simple text generation to complex systems capable of autonomous planning, calling…
Vercel officially announced the launch of the "Vercel OSS Bug Bounty Program." The core purpose of this program is to improve the security of Vercel's many…
### The Age of Practical AI Agents Has Arrived In this edition of his column, Jack Clark shares his personal breakthrough in using AI Agents. Previously, many…
Vercel officially announced the launch of a striking security challenge — the "React2Shell" $1 Million Hacker Challenge. This initiative invites the world's…
### Background and Challenge: Automated Abuse Targeting AI Services Nous Research, a leading open-source AI research organization, has released many popular…
Google DeepMind has unveiled a new AI Agent called "CodeMender," designed to leverage advanced artificial intelligence to automatically remediate critical…
Google DeepMind has officially launched VaultGemma, currently the world's most capable large language model (LLM) trained entirely from scratch using…
Hugging Face has announced the launch of a new open-source project called "OpenEnv," aimed at collaboratively building an open and standardized execution…
As open-source AI models have grown explosively, model security has become an issue that can no longer be ignored. Traditional machine learning model formats…
As large language models (LLMs) have evolved, AI applications have moved beyond simple "question-and-answer conversations" toward "AI Agents" capable of…