Latest in AI

Showing:DevelopersClear ×

🔥 Trending today

anthropic3 open-source3 ipo2 export-controls2 privacy2 Regulation2 ai-governance2 ai-investment1 public-markets1 startup-ecosystem1

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Running Python code in a sandbox with MicroPython and WASM
Simon Willison's Weblog9 days agoNew Tool
Simon Willison describes his latest attempt to safely run Python plugin-style code inside his own applications. The alpha package micropython-wasm uses MicroPython compiled to WebAssembly, executed through the maintained wasmtime Python library. His goals include clean PyPI installation, CPU and memory limits, controlled file and network access, host functions, and reliable documentation.
Ask HN: Why is the HN crowd so anti-AI?
Hacker News (AI keywords)9 days agoOpinion
This Hacker News Ask HN post asks why the HN community seems so anti-AI. Since no body text is provided, the specific argument, examples, and comment direction cannot be verified. Based on the title alone, it is best classified as a community opinion discussion about AI skepticism, likely relevant to developers and general tech readers tracking sentiment around AI adoption.
OpenAI Help: Lockdown Mode★ 74
Simon Willison's Weblog9 days agoCommentary
Simon Willison notes that OpenAI’s previously teased Lockdown Mode is now live for eligible personal and self-serve Business ChatGPT accounts. The feature does not stop prompt injections from appearing in content, but limits outbound network requests that could leak sensitive data. He sees it as a direct mitigation for the exfiltration leg of the “Lethal Trifecta,” while implying default ChatGPT settings are not robust against determined data theft attempts.
Nordstjernen 1.0.0 Released: First Stable Version of the Web Project
Hacker News (AI keywords)9 days agoRelease
The open-source project Nordstjernen has officially released version 1.0.0 on GitHub. Housed under the 'nordstjernen-web' organization, this milestone release signifies a transition to a stable API and production readiness. Due to minimal release notes in the source, developers are encouraged to inspect the repository for tech stack and AI integration details.
Thousand Token Wood: shipping a multi-agent economy on a 3B model
Hugging Face Blog9 days agoTutorial
Based on the title, this Hugging Face Blog post presents Thousand Token Wood, a project shipping a multi-agent economy on a 3B model. The likely focus is practical system design under small-model constraints, rather than a new frontier-scale model release. Without the original text, details such as the exact model, architecture, benchmarks, code availability, and results cannot be confirmed.
Hermes Agent – Open-source AI agent with persistent memory
Hacker News (AI keywords)9 days agoNew Tool
Hermes Agent is an open-source autonomous agent by Nous Research, designed to run on your own server or machine with persistent local memory. It offers messaging gateways, scheduled automations, browser control, parallel sub-agents, reusable skills, and multiple LLM provider options. The project also targets MLOps and research workflows, including tool-calling trajectory generation, RL experiments, and exportable fine-tuning data.
Warren's Abstract Machine: A Tutorial Reconstruction
Hacker News (AI keywords)9 days agoTutorial
This repository preserves Hassan Ait-Kaci’s out-of-print tutorial on the Warren Abstract Machine, a key execution model for Prolog and logic programming systems. It is not a new AI model or product launch, but a useful historical and educational resource. The material is most relevant to developers and researchers interested in symbolic AI, compilers, unification, backtracking, and logic language runtimes.
Hacker News, Sans AI
Hacker News (AI keywords)9 days agoCommentary
The source text was not provided, so only the title and metadata can be used. The piece likely discusses filtering AI-related stories from Hacker News or the broader fatigue around AI-heavy tech news feeds. It appears to be commentary rather than a model release, paper, benchmark, or technical tutorial.
Auth issue impacted GitHub APIs and some Slack and Teams subscriptions
Hacker News (AI keywords)9 days agoIncident
GitHub resolved an incident on June 5, 2026 involving incorrect authorization failures for some authenticated requests. During 14:49-16:45 UTC, a small number of endpoints saw a 1-2% increase in 4xx responses, while most requests completed normally. The issue was tied to a recently enabled feature flag, which GitHub disabled; affected Slack and Teams subscriptions were later restored.
Google will pay SpaceX $920M per month for compute★ 78
TechCrunch AI9 days agoBusiness
SpaceX announced a major compute rental deal with Google one week before its expected Nasdaq debut. From October 2026 through June 2029, Google will pay $920 million per month for access to about 110,000 NVIDIA GPUs, plus CPUs, memory, and related components. The agreement resembles SpaceX’s recent Anthropic deal and includes a 90-day cancellation option after December 31, 2026.
Transformers are inherently succinct★ 74
Hacker News (AI keywords)9 days agoPaper
This paper studies transformer expressivity through succinctness: how compactly a formalism describes a language. It proves fixed-precision transformers can be exponentially more succinct than LTL and RNNs, and doubly exponentially more succinct than finite automata. The same succinctness makes verification hard, with basic problems such as emptiness and equivalence shown to be EXPSPACE-complete.
How to Stop Shipping Low-Quality RL Environments (with Examples)
Latent Space9 days agoTutorial
The post argues that low-quality RL environments are not harmless infrastructure bugs; they can make models worse by feeding them broken learning signals. Based on years of inspecting trajectories, the author highlights recurring environment and harness failures that teams need to fix. The practical lesson is to debug the training environment, grader, and interaction traces before blaming the model or scaling training.
Harness engineering: Leveraging Codex in an agent-first world★ 76
Hacker News (AI keywords)9 days agoCommentary
OpenAI describes an internal experiment where Codex generated an entire product codebase from an empty repository. The post argues that engineers shift from writing code to designing environments, constraints, documentation, and feedback loops. Key practices include repo-local knowledge, mechanical architecture enforcement, agent-readable UI and observability, lightweight PR flow, and continuous cleanup.
Tiny hackable CUDA language model implementation
Hacker News (AI keywords)9 days agoNew Tool
This GitHub project implements a compact generative pretrained transformer as an autoregressive byte-level sequence model. Its README describes causal self-attention, RoPE, feed-forward layers, AdamW, cross-entropy training, and BLAS/OpenBLAS-backed matrix operations, with CUDA toolkit listed in setup steps. It is most useful as an educational and experimental codebase, not as a production-grade replacement for large commercial LLMs.
Mantine DataTable source repo compromised; owner account suspended★ 74
Hacker News (AI keywords)10 days agoIncident
A GitHub security notice says Mantine DataTable and other repositories received unauthorized commits through the github-actions bot. The npm packages were reported safe; the risk targets developers who recently cloned or pulled the source and open it in VS Code, Cursor, Claude Code, Gemini, or run npm test. A later update links the payload to the Miasma / Shai-Hulud worm family and says a stolen credential is the likely path.
This is your laptop… on AI
The Verge AI10 days agoHardware
The episode frames developer conference season around Big Tech’s conviction that AI will reshape how people use technology. Nvidia CEO Jensen Huang is highlighted for describing a completely new way to use laptops. Based on the provided excerpt, this is more of an industry commentary on AI PCs than a concrete product-spec report.
Launch HN: General Instinct (YC P26) - Frontier models on edge devices
Hacker News (AI keywords)10 days agoNew Tool
General Instinct is a YC P26 company introduced through a Launch HN post. Its headline positioning is bringing frontier models to edge devices, suggesting local or embedded AI deployment rather than purely cloud-based inference. Since no article body is available, details such as supported models, hardware, benchmarks, pricing, and developer tooling cannot be verified from the provided source.
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72
Hacker News (AI keywords)10 days agoRelease
Google released new Gemma 4 checkpoints optimized with Quantization-Aware Training to preserve quality after compression. The release includes Q4_0 checkpoints and a mobile-focused quantization format that can reduce Gemma 4 E2B memory use to about 1GB, or below 1GB for a text-only configuration. The models are available through Hugging Face and supported across llama.cpp, Ollama, LM Studio, LiteRT-LM, Transformers.js, SGLang, vLLM, MLX, and Unsloth.
U.S. Military Turned GPS into a Global "Numbers Station"
Hacker News (AI keywords)10 days agoCommentary
According to investigative outlet 404 Media, evidence suggests the U.S. military has repurposed the Global Positioning System (GPS) into a modern "numbers station." By embedding encrypted data within standard GPS broadcasts, the military can securely transmit covert messages to agents or assets worldwide. This technique leverages existing satellite infrastructure to achieve global coverage with near-perfect receiver anonymity.
pg_durable: Microsoft open sources in-database durable execution
Hacker News (AI keywords)10 days agoRelease
Microsoft has open sourced pg_durable on GitHub, described in the title as an in-database durable execution project. From the name, it likely relates to PostgreSQL and persistence of execution state inside the database. Since no article body or README content was provided, details such as architecture, maturity, licensing, and production readiness cannot be confirmed.
Ask HN: What is your (AI) dev tech stack / workflow?
Hacker News (AI keywords)10 days agoCommentary
An Ask HN thread asks developers to share their current AI-assisted development setup for upcoming in-person workshops. The author wants guidance for beginners and working developers, with use cases ranging from static sites to FastAPI tools and Linux home automation. Replies cover Claude Code, Cursor, GitHub Copilot, VSCode, spec-driven development, TDD, multi-agent workflows, reviews, and quality control.
The token bill comes due: Inside the scramble to manage AI costs★ 78
TechCrunch AI10 days agoBusiness
TechCrunch reports that enterprise AI spending has shifted from rapid adoption to cost control. Even as per-token prices fall, broader AI rollout and agentic coding tools are multiplying consumption, pushing companies over budget. A new Tokenomics Foundation under the Linux Foundation aims to standardize AI token cost tracking, billing metrics, and efficiency language.
Your AI bill is out of control. Cloudflare can fix it now.
Cloudflare Blog10 days agoRelease
Cloudflare AI Gateway now supports real-time spend limits for AI usage across multiple providers. The feature is meant to prevent runaway token bills before costs spiral out of control. By integrating with Cloudflare Access, companies can apply identity-driven budgets and policies, making AI cost governance more closely tied to users, teams, and access rules.
The Quiet Numbers Station: Decoding Nineteen Years of GPS Cryptography
Hacker News (AI keywords)10 days agoPaper
Published on UCL's Bentham's Gaze blog, this research analyzes GPS cryptographic signals over a 19-year span, likening the satellites to 'quiet numbers stations.' The authors explore the evolution of GPS encryption (such as military P(Y) code and civilian authentication), evaluating their cryptographic strength and potential vulnerabilities using modern computational analysis.
Did Claude Increase Bugs in rsync?
Hacker News (AI keywords)10 days agoBenchmark
The article analyzes rsync releases to test whether versions containing Claude commits had unusually high bug rates. It uses severity-weighted bugs per 10 commits, exact permutation testing, and Fisher's exact test. With only two Claude-exposed releases, the evidence is limited, but both releases appear within normal historical variation rather than clear negative outliers.
Programmers will document for Claude, but not for each other
Hacker News (AI keywords)10 days agoOpinion
The post responds to complaints that programmers now write detailed CLAUDE.md and PROJECT.md files for AI, but not for coworkers. The author describes using Claude to maintain handoff notes between sessions and generate final high-level project summaries. His advice is to review those documents carefully, then commit them to the repository because they may help future maintainers.
Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG★ 72
Google Research Blog10 days agoRelease
Google Research and Google Cloud introduced an agentic RAG framework hosted on Gemini Enterprise Agent Platform. It uses multiple agents to plan, rewrite, route, retrieve, verify sufficient context, iterate, and synthesize answers. Google reports up to 34% factuality accuracy gains over standard RAG, plus 90.1% accuracy in a cross-corpus FramesQA setting with similar latency to single-corpus retrieval.
Quoting Andreas Kling
Simon Willison's Weblog10 days agoEthics
Simon Willison quotes Andreas Kling explaining Ladybird’s decision to stop accepting public pull requests. Kling argues that large patches once implied substantial effort, which could serve as a proxy for good faith, but generative AI has weakened that assumption. His central point is not whether code was typed by hand, but who takes responsibility for code once it enters a browser intended for real users.
Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy
INSIDE 硬塞 AI10 days agoBusiness
Anthropic co-founder and Anthropic Labs lead Ben Mann made his first visit to Taiwan, according to INSIDE. The report highlights his role in leading Claude Code and the Model Context Protocol, two key parts of Anthropic’s developer-focused product direction. The discussion centered on Claude strategy, AI safety boundaries, jobs, and Taiwan’s strategic role in the AI landscape.
Arithmetic Without Numbers: How LLMs Do Math
Hacker News (AI keywords)10 days agoCommentary
The article asks whether LLM arithmetic is memorization, heuristics, real computation, or experimental assistance. It summarizes Rune experiments that decode operations and operands from frozen Llama activations, then route them to Python under a no-parser rule. The strongest supported claim is narrow: activation-derived tool arguments worked in scoped audits, while residual-state JIT replacement, long-number generation, and cross-model transfer remain brittle.

← PreviousPage 18Next →

Latest in AI

Running Python code in a sandbox with MicroPython and WASM

Ask HN: Why is the HN crowd so anti-AI?

OpenAI Help: Lockdown Mode★ 74

Nordstjernen 1.0.0 Released: First Stable Version of the Web Project

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Hermes Agent – Open-source AI agent with persistent memory

Warren's Abstract Machine: A Tutorial Reconstruction

Hacker News, Sans AI

Auth issue impacted GitHub APIs and some Slack and Teams subscriptions

Google will pay SpaceX $920M per month for compute★ 78

Transformers are inherently succinct★ 74

How to Stop Shipping Low-Quality RL Environments (with Examples)

Harness engineering: Leveraging Codex in an agent-first world★ 76

Tiny hackable CUDA language model implementation

Mantine DataTable source repo compromised; owner account suspended★ 74

This is your laptop… on AI

Launch HN: General Instinct (YC P26) - Frontier models on edge devices

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72

U.S. Military Turned GPS into a Global "Numbers Station"

pg_durable: Microsoft open sources in-database durable execution

Ask HN: What is your (AI) dev tech stack / workflow?

The token bill comes due: Inside the scramble to manage AI costs★ 78

Your AI bill is out of control. Cloudflare can fix it now.

The Quiet Numbers Station: Decoding Nineteen Years of GPS Cryptography

Did Claude Increase Bugs in rsync?

Programmers will document for Claude, but not for each other

Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG★ 72

Quoting Andreas Kling

Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy

Arithmetic Without Numbers: How LLMs Do Math