Latest in AI

Showing:DevelopersClaudeClear ×

🔥 Trending today

anthropic4 open-source3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Managing Multiple MCP Servers: How to Prevent Context Pollution and Token Waste
r/LocalLLaMA top day7 days agoCommentary
A popular Reddit thread on r/LocalLLaMA addresses the challenge of loading multiple Model Context Protocol (MCP) servers at startup, which floods the context window with tool definitions. Users are discussing potential solutions, including using MCP proxies/hubs to route requests through a single endpoint or implementing lazy-loading. This highlights a growing need for better orchestration tools as the local MCP ecosystem expands.
Anthropic, please ship an official Claude Desktop for Linux
Hacker News (AI keywords)7 days agoOpinion
The available source only provides the title, which asks Anthropic to ship an official Claude Desktop app for Linux. It appears to be a community feature request rather than a confirmed product announcement. Without the issue body or official response, there is no basis to infer Anthropic’s plans, timeline, or technical reasoning.
Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them
Hacker News (AI keywords)7 days agoBusiness
The author uses a Claude Code coding experiment to estimate the API-equivalent cost of serious LLM coding. They argue simple chats are cheap, but complex reasoning and multi-file coding can burn large amounts of visible and hidden tokens. The piece is skeptical and estimate-driven, concluding that current $100/month plans may be heavily subsidized and economically fragile.
LLMs are eroding my software engineering career and I don't know what to do
Hacker News (AI keywords)7 days agoOpinion
The author argues that LLMs are eroding three pillars of his software engineering career: domain knowledge, debugging skill, and architecture judgment. Tools like ChatGPT, Claude, Claude Code, Codex, MCP, Sentry MCP, and DataDog MCP increasingly handle design, implementation, and difficult production bugs. The essay frames this as a labor-market concern, not just a tooling debate: if expertise becomes promptable, engineers may struggle to remain differentiated.
Show HN: Lathe - Use LLMs to learn a new domain, not skip past it
Hacker News (AI keywords)7 days agoNew Tool
Lathe is an open-source tool for generating hands-on technical tutorials with LLM skills. It combines a Go CLI, local reading UI, and commands for asking questions, extending tutorials, and verifying outputs. The project supports Claude Code, Cursor, and Codex workflows, with an emphasis on learning by typing and reasoning through the material yourself.
Her · हेर — a detective for your Claude Code sessions
Hugging Face Blog7 days agoNew Tool
The title presents Her · हेर as a detective for Claude Code sessions. Because the article body is unavailable, its actual features, setup, and implementation details cannot be verified. Conservatively, it appears relevant to developers who want better visibility into what happened during AI-assisted coding sessions.
I design with Claude more than Figma now
Hacker News (AI keywords)7 days agoOpinion
Jane Street designer Edwin Morris describes moving from skepticism about LLMs to using Claude as a core design tool. Instead of relying mainly on specs and Figma mockups, he now builds working prototypes directly in the real codebase. The post also explores the collaboration risks: prototypes must remain disposable proposals, not finished features that shut reviewers out of design input.
Mantine DataTable source repo compromised; owner account suspended★ 74
Hacker News (AI keywords)9 days agoIncident
A GitHub security notice says Mantine DataTable and other repositories received unauthorized commits through the github-actions bot. The npm packages were reported safe; the risk targets developers who recently cloned or pulled the source and open it in VS Code, Cursor, Claude Code, Gemini, or run npm test. A later update links the payload to the Miasma / Shai-Hulud worm family and says a stolen credential is the likely path.
Ask HN: What is your (AI) dev tech stack / workflow?
Hacker News (AI keywords)9 days agoCommentary
An Ask HN thread asks developers to share their current AI-assisted development setup for upcoming in-person workshops. The author wants guidance for beginners and working developers, with use cases ranging from static sites to FastAPI tools and Linux home automation. Replies cover Claude Code, Cursor, GitHub Copilot, VSCode, spec-driven development, TDD, multi-agent workflows, reviews, and quality control.
The token bill comes due: Inside the scramble to manage AI costs★ 78
TechCrunch AI9 days agoBusiness
TechCrunch reports that enterprise AI spending has shifted from rapid adoption to cost control. Even as per-token prices fall, broader AI rollout and agentic coding tools are multiplying consumption, pushing companies over budget. A new Tokenomics Foundation under the Linux Foundation aims to standardize AI token cost tracking, billing metrics, and efficiency language.
Did Claude Increase Bugs in rsync?
Hacker News (AI keywords)9 days agoBenchmark
The article analyzes rsync releases to test whether versions containing Claude commits had unusually high bug rates. It uses severity-weighted bugs per 10 commits, exact permutation testing, and Fisher's exact test. With only two Claude-exposed releases, the evidence is limited, but both releases appear within normal historical variation rather than clear negative outliers.
Programmers will document for Claude, but not for each other
Hacker News (AI keywords)9 days agoOpinion
The post responds to complaints that programmers now write detailed CLAUDE.md and PROJECT.md files for AI, but not for coworkers. The author describes using Claude to maintain handoff notes between sessions and generate final high-level project summaries. His advice is to review those documents carefully, then commit them to the repository because they may help future maintainers.
Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy
INSIDE 硬塞 AI9 days agoBusiness
Anthropic co-founder and Anthropic Labs lead Ben Mann made his first visit to Taiwan, according to INSIDE. The report highlights his role in leading Claude Code and the Model Context Protocol, two key parts of Anthropic’s developer-focused product direction. The discussion centered on Claude strategy, AI safety boundaries, jobs, and Taiwan’s strategic role in the AI landscape.
Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72
INSIDE 硬塞 AI9 days agoBusiness
Anthropic introduced Project Glasswing after Claude Mythos Preview showed the ability to rapidly find high-risk vulnerabilities and generate connected attack commands. Trend Micro’s TrendAI has joined the framework, becoming the first Taiwanese cybersecurity vendor to do so. The article frames the move around Taiwan’s strategic AI hardware role and a new defensive logic: using AI to counter malicious AI.
Fine-tuning an LLM to write docs like it's 1995
Hacker News (AI keywords)9 days agoTutorial
The author builds a corpus from old Microsoft manuals, cleans OCR text, generates instruction-style JSONL examples, and fine-tunes Llama 3.1 8B and Qwen 2.5 7B with QLoRA. Tests cover malloc(), a fictional Win32 API, and a deliberately anachronistic REST API prompt. Qwen fine-tunes transfer the period documentation style best, but the experiment also shows hallucination risks, tuning complexity, and why these models augment rather than replace technical writers.
Microsoft AI Chief Says Anthropic Is Too Expensive as Enterprise AI Costs Rise
INSIDE 硬塞 AI9 days agoBusiness
Microsoft AI chief Mustafa Suleyman reportedly criticized Anthropic’s models as unacceptably expensive, highlighting rising enterprise AI costs. The article frames this as part of a broader “AI tax” problem, with companies reassessing ROI as vendor pricing pressure grows. Microsoft’s MAI models are presented as a potential internal alternative to reduce reliance on costly external providers.
Ahead of its IPO, Anthropic’s Daniela Amodei shrugs off doubts about AI returns★ 76
TechCrunch AI10 days agoBusiness
TechCrunch reports that Anthropic has confidentially filed for an IPO while private investor demand remains strong. Co-founder Daniela Amodei said frontier AI companies need large amounts of capital because model training and inference are expensive. She also downplayed doubts about enterprise AI returns, arguing businesses are still early in learning how to use AI effectively, and explained why Anthropic prefers not to overbuild its own compute infrastructure.
Show HN: Formally verified polygon intersection, Opus 4.8 one-shot
Hacker News (AI keywords)10 days agoNew Tool
This GitHub project presents a formally verified multipolygon intersection algorithm checked in Lean 4. The author argues trust comes from the Lean checker and a small human-reviewed specification, not from trusting LLM output directly. It also documents how Claude Opus versions improved on Lean proof work, with Opus 4.8 reportedly completing larger proof strategies that earlier attempts could not.
Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs
Latent Space10 days agoBenchmark
Latent Space talks with Lukas Petersson and Axel Backlund of Andon Labs, the authors behind VendingBench. The episode focuses on evaluating Claude models across a range from Haiku to Mythos. It also discusses how they build frontier evals from scratch, with an emphasis on creating benchmarks that remain useful and meaningful over time.
Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud
Hacker News (AI keywords)10 days agoNew Tool
Boxes.dev appeared on Hacker News as a Show HN post, positioning itself as a way to move Claude Code and Codex workflows from localhost to the cloud. Based only on the title, it seems aimed at cloud development or remote agent execution. The provided source does not include details on architecture, pricing, security, integrations, or limitations.
My Agent Skill for Test-Driven Development
Hacker News (AI keywords)10 days agoTutorial
Jason Swett argues that uncoached AI agents still tend to write poor tests: vague, overcomplicated, tautological, or performative. His personal TDD skill guides agents through a specify-encode-fulfill loop inspired by Kent Beck’s Canon TDD. He also uses separate test and software design review skills, sometimes with Claude, to catch weak test design and prompt cleanup before implementation.
Reve 2 and Ideogram 4: Layouts in Imagegen
Latent Space10 days agoRelease
Latent Space’s roundup frames image composition as a major barrier now being tackled by layout-aware image models. Reve 2.0 emphasizes precise generation and editing with layouts, while Ideogram 4.0 uses bounding boxes tied to region descriptions. The issue also covers MAI-Thinking-1, Gemma 4 12B, open audio models, agent execution layers, and model-routing cost debates.
I built a vulnerable app and spent $1,500 seeing if LLMs could hack it
Hacker News (AI keywords)10 days agoBenchmark
The author built a vulnerable React Native app with a Python backend and a Firebase access-control flaw. GPT 5.5 solved 7 of 10 runs, while Deepseek and Claude variants solved fewer attempts. Many other models failed due to refusals, API-focused tunnel vision, false positives, or inability to use the exposed Firebase path correctly.
How we contain Claude across products★ 74
Hacker News (AI keywords)10 days agoCommentary
Anthropic describes containment as the core security strategy for increasingly capable Claude agents. The post compares ephemeral containers for claude.ai, OS-level sandboxing and approvals for Claude Code, and VM isolation for Claude Cowork. It also details missed risks, including pre-trust project config execution, user-delivered prompt injection, exfiltration through approved domains, and reduced enterprise visibility inside VMs.
Lovable signs multi-year deal with Google Cloud to up usage 5x, source says
TechCrunch AI10 days agoBusiness
TechCrunch AI reports that Lovable and Google signed an expanded multi-year agreement. The deal reportedly includes a fivefold expansion of Lovable’s footprint on Google Cloud. It also includes expanded access to Anthropic Claude, though the article does not specify contract value, timing, exact Claude usage, or any immediate product changes for users.
How LLMs Actually Work
Hacker News (AI keywords)11 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.
No, Artificial Intelligence Is Not Conscious★ 72
Hacker News (AI keywords)11 days agoOpinion
Ted Chiang criticizes the anthropomorphic framing around Anthropic’s Claude and its constitution. He argues that LLMs are sentence-continuation systems producing fictional conversational roles, not entities with subjective experience. The essay warns that presenting chatbots as morally aware risks misleading users and shifting responsibility away from humans and companies.
Uber Caps Usage of AI Tools Like Claude Code to Manage Costs
Simon Willison's Weblog11 days agoBusiness
Uber has reportedly capped employee token spending at $1,500 per month for each agentic AI coding tool, including Cursor and Claude Code. Simon Willison frames this as a rational response to overspending, especially after earlier discussion that Uber exhausted its 2026 AI budget in four months. He estimates that two actively used tools would imply a $36,000 annual cap per engineer, about 11% of median US Uber software engineer compensation.
Microsoft Build: MAI-Thinking-1 and MAI Family Models★ 78
Latent Space11 days agoRelease
Microsoft used Build to present itself as both an AI platform and a first-party model lab, announcing seven MAI models across reasoning, code, image, transcription, and voice. The standout was MAI-Thinking-1, described as a 35B active MoE with 256K context and clean data lineage. The recap also ties the launches to GitHub Copilot, Windows agent runtime ambitions, Web IQ grounding APIs, Foundry distribution, and MAIA 200 hardware.
Claude Code Lead Boris Cherny Says His Code Is 100% AI-Written
INSIDE 硬塞 AI11 days agoCommentary
Claude Code lead Boris Cherny says his code is now 100% written by AI while he runs hundreds of agents in parallel. The article frames engineers less as manual coders and more as conductors who define problems, review outputs, and shape architecture. It highlights a broader shift in software development workflows driven by AI coding agents, without presenting detailed benchmarks or implementation data.

← PreviousPage 4Next →

Latest in AI

Managing Multiple MCP Servers: How to Prevent Context Pollution and Token Waste

Anthropic, please ship an official Claude Desktop for Linux

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them

LLMs are eroding my software engineering career and I don't know what to do

Show HN: Lathe - Use LLMs to learn a new domain, not skip past it

Her · हेर — a detective for your Claude Code sessions

I design with Claude more than Figma now

Mantine DataTable source repo compromised; owner account suspended★ 74

Ask HN: What is your (AI) dev tech stack / workflow?

The token bill comes due: Inside the scramble to manage AI costs★ 78

Did Claude Increase Bugs in rsync?

Programmers will document for Claude, but not for each other

Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy

Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72

Fine-tuning an LLM to write docs like it's 1995

Microsoft AI Chief Says Anthropic Is Too Expensive as Enterprise AI Costs Rise

Ahead of its IPO, Anthropic’s Daniela Amodei shrugs off doubts about AI returns★ 76

Show HN: Formally verified polygon intersection, Opus 4.8 one-shot

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud

My Agent Skill for Test-Driven Development

Reve 2 and Ideogram 4: Layouts in Imagegen

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

How we contain Claude across products★ 74

Lovable signs multi-year deal with Google Cloud to up usage 5x, source says

How LLMs Actually Work

No, Artificial Intelligence Is Not Conscious★ 72

Uber Caps Usage of AI Tools Like Claude Code to Manage Costs

Microsoft Build: MAI-Thinking-1 and MAI Family Models★ 78

Claude Code Lead Boris Cherny Says His Code Is 100% AI-Written