Paseo provides one interface for tools such as Claude Code, Codex, Copilot, OpenCode, and Pi. It runs agents through a local daemon on the user's own machine and supports desktop, mobile, web, and CLI clients. Its appeal is multi-agent orchestration and cross-device control, though real adoption depends on workflow fit, security, and reliability.
Microsoft announced MAI-Thinking-1, a 35B reasoning model available to select early partners, and MAI-Code-1-Flash, a 5B coding model rolling out to GitHub Copilot individual users in VS Code. Simon Willison highlights their relatively small parameter counts and Microsoft's claim that MAI-Thinking-1 was preferred to Sonnet 4.6 in internal blind evaluations. He also questions what Microsoft's clean and appropriately licensed training data claims mean in practice.
Microsoft unveiled Scout at Build as a new “autopilot” agent for Microsoft 365. It can connect across Teams, Outlook, OneDrive, and SharePoint, use an Entra identity, and interact with external apps through MCP. The release is experimental for Frontier customers, with security controls required. Analysts warn Scout may amplify existing governance problems because it can act on data, not merely surface it.
Anthropic is expanding its Project Glasswing security vulnerability program and access to Mythos. The rollout covers 150 organizations across 15 countries, focusing on power, water, healthcare, and communications infrastructure. The company is targeting sectors where a cyberattack could affect as many as 100 million people, although implementation details and participating organizations were not disclosed in the provided text.
A Hacker News poster says they received a self-promotional AI/LLM services email shortly after posting in a job-seeking thread. The email appeared to exploit the context of their search, turning a moment of hope into another discouraging spam interaction. The discussion broadened into concerns about AI-generated cold outreach, recruiter spam, cybersecurity pitches, and the need for basic empathy in automation.
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
Simon Willison released Pasted File Editor, a browser prototype inspired by Claude's handling of large pasted text. Instead of filling the editor with a large paste, the tool turns the content into a file attachment. It also supports opening files directly, dragging files onto the interface, and displaying images as thumbnails. Codex desktop helped build the prototype.
Stanford CS336’s CLAUDE.md sets boundaries for AI coding assistants such as ChatGPT, Claude Code, GitHub Copilot, and Cursor. Agents may explain concepts, review student-written code, suggest debugging checks, and point to course materials. They should not write code, complete TODOs, edit repositories, run shell commands, or implement core assignment components for students.
This is Hacker News’ June 2026 “Who wants to be hired?” thread for individuals actively looking for work. Posters are asked to share location, remote preference, relocation willingness, technologies, resume or CV, and email. Visible comments include developers, full-stack engineers, data science consultants, systems engineers, and designers, with some mentioning LLM integration, RAG, AI agents, Gemini API, and Claude tool calling as part of their experience.
Simon Willison relates to David Wilson's reflection on launching more than 16 projects with AI tooling. A request for a quick Claude script can expand into an hour-long project without solving the original problem. Coding agents may produce tested, documented solutions rapidly, but people can maintain only so many projects. The critical skill may be discipline: deciding which ideas deserve continued attention.
The article introduces Agent Radio, a messaging feature in h5i 0.1.5 for coding agents such as Claude Code and Codex. Instead of relying on an external server, it stores JSONL messages in a Git ref and syncs them through normal push and pull flows. The post includes setup commands, live message watching, PR summary posting, and a short explanation of the i5h protocol.
Anthropic explains how process sandboxes, VMs, filesystem boundaries, and egress controls limit what Claude agents can access. Claude.ai uses gVisor; local Claude Code uses Seatbelt on macOS and Bubblewrap on Linux; Cowork runs in a full VM. Simon Willison highlights the documentation quality, notes a previously missed file-exfiltration path, and plans to revisit Anthropic's open-source srt tool.
Simon Willison demonstrates an experiment for running Python ASGI apps entirely in the browser using Pyodide and a Service Worker. The approach addresses a Datasette Lite limitation: HTML returned through intercepted navigation did not execute script tags, breaking features and plugins. Claude Opus 4.8, used through Claude Code for web, helped explore the implementation. Basic ASGI and Datasette 1.0a31 demos are available.
Simon Willison highlights Chad Whitacre’s decision to leave tech and Open Source, framed not as a forum threat but as concrete action. Whitacre describes wanting to become “AI Amish” or “Internet Amish,” moving toward an offline, analog life closer to 1980 than 1780. A previous post about using Claude Code with Opus 4.5 shows how agentic AI felt intoxicating and unsettling enough to push him away from technological accelerationism.
The source is a Hacker News AI-keyword item linking to a Mastodon post titled “Rsync 3.4.3 has hundreds of Claude commits.” No original body text is available, so the only reliable claim is that many commits in Rsync 3.4.3 are described as Claude-related. The exact meaning, review process, quality impact, and author’s stance cannot be confirmed from the title alone.
Quandri measured MCP tool schemas in its Claude Code setup and found significant context overhead across Linear, Notion, Slack, and Postgres. The post argues MCP can be slower, less reliable, and harder to debug than direct CLI/API usage. It recommends CLI-first workflows and on-demand Skills, while noting MCP still fits services without CLIs, non-developer users, bidirectional communication, and guarded production database access.
Roundtable argues that CAPTCHA image recognition is largely solved, but process-level behavior still separates humans from AI agents. Their CogCAPTCHA30 benchmark combines CAPTCHA with cognitive psychology tasks to test not only outputs, but how answers are produced. Results suggest frontier models like Claude, GPT, and Gemini are not necessarily more humanlike than smaller or cognition-trained models.
Anthropic released Claude Opus 4.8 as a rapid iteration focused on stronger integrity and reliability for high-risk tasks. The company also previewed Dynamic Workflows, a feature designed to coordinate multiple agents on large-scale jobs such as code migration. The article mentions Mythos entering a countdown toward unblocking, but does not provide detailed availability or product specifics.
Anthropic completed a $65 billion Series H round, bringing its valuation to $965 billion and reportedly surpassing OpenAI. The round included strategic investments from memory makers Micron, Samsung, and SK Hynix. The news highlights how frontier AI companies are increasingly tied to hardware and memory supply chains, as investors continue backing foundational model competition.
INSIDE reports that SYSTEX is pushing forward with SaaS and enterprise AI despite debate sparked by Claude Code and claims that “SaaS is dead.” The Taiwanese IT services leader reported strong Q1 2026 earnings, with net profit after tax of NT$718 million, up 164.5% year over year. It also introduced EAP, an Enterprise AI Platform built on Amazon Web Services cloud-native architecture to support enterprise AI adoption.
INSIDE reports that SYSTEX is positioning its Enterprise AI Platform as a cloud-native route for enterprise generative AI adoption. The article contrasts this with recent “SaaS is dead” discussions sparked by tools such as Claude Code. SYSTEX also reported strong Q1 2026 earnings, with after-tax profit of NT$718 million, up 164.5% year over year.
The post inspects @anthropic-ai/[email protected] and documents configuration fields not covered by the official docs. It highlights hook JSON responses, hidden skill and agent frontmatter, auto-mode rules, persistent memory, dream consolidation, Magic Docs, and permission syntax. The author frames these as practical but version-specific findings, with experimental fields especially likely to change.
The visible AINews item centers on Anthropic, claiming a $965B Series H alongside Opus 4.8 and Dynamic Workflows/ultracode releases. The available body text is extremely brief, offering only the editorial line “Total Anthropic victory!” It signals a major Anthropic narrative across capital, Claude models, and developer workflows, but provides no detailed specs, benchmarks, investor terms, or availability information.
Simon Willison highlights Anthropic’s latest Series H announcement, where the company says run-rate revenue crossed $47 billion earlier in May. He traces prior disclosures: about $9 billion at the end of 2025, $14 billion in February 2026, and over $30 billion in April. The post also addresses skepticism, arguing that these numbers appeared in fundraising announcements, where knowingly misleading investors would be securities fraud.
Anthropic shipped Claude Opus 4.8, and Simon Willison highlights the unusually restrained release language: a “modest but tangible improvement.” The model keeps most Opus 4.7 pricing and specs, while evaluations suggest it is more likely to flag uncertainty and less likely to ignore flaws in code it wrote. Developer-relevant changes include mid-conversation system messages and a lower prompt-cache minimum of 1,024 tokens.
Simon Willison released llm-anthropic 0.25.1 with support for the new Claude Opus 4.8 model, exposed as claude-opus-4.8. The release adds a -o fast 1 option for Anthropic fast mode, limited to organizations that have the feature enabled. It also changes default max_tokens behavior so each model now defaults to its maximum output instead of 8,192.
Simon Willison shared markdown-svg-renderer, a customized Markdown rendering tool with special handling for fenced SVG code blocks. It renders the SVG image and also provides a tab for switching back to the source code. Users can paste Markdown directly or load a CORS-enabled Markdown file or Gist by URL, with an example using LLM pelican logs for Opus 4.8.
Illinois lawmakers passed a landmark AI accountability bill requiring major frontier AI developers to publish safety frameworks, assess catastrophic risks, report incidents, and undergo third-party audits. OpenAI and Anthropic supported the measure, while industry groups warned that state-level rules could impose subjective compliance duties without national standards. The bill signals that states are continuing to fill the federal AI regulation gap despite Trump’s efforts to limit fragmented state oversight.
Anthropic has released a new Opus model, Opus 4.8, alongside a tool called Dynamic Workflows. The report says the tool is designed to coordinate swarms of subagents, pointing to a focus on multi-agent orchestration. The source does not provide benchmarks, pricing, API details, availability, or concrete use cases.
Anthropic is releasing Claude Opus 4.8 and highlighting the model’s “honesty” as a key improvement. The company says it trains its models to avoid unsupported claims, addressing a broader issue where AI systems sometimes jump to conclusions. Based on the provided excerpt, the update is positioned around reliability and uncertainty handling rather than a specific new tool or benchmark result.