Latest in AI

🔥 Trending today

anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 open-source2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Microsoft AI Chief Mustafa Suleyman Walks Back Comments on AI Replacing White-Collar Jobs
The Verge AI5 days agoCommentary
Microsoft AI CEO Mustafa Suleyman walked back his previous comments about AI automating white-collar jobs like lawyers and accountants. Speaking on the Decoder podcast, he clarified that AI is meant to help these professionals complete specific tasks, such as drafting emails, rather than replacing their entire roles. This shift highlights the ongoing industry effort to balance AI capability marketing with public concerns over job displacement.
Warning: OpenCode Go/Zen Has No Account or Data Deletion Feature
r/LocalLLaMA top day5 days agoIncident
A Reddit user warns that OpenCode Go/Zen provides no mechanism for users to delete their account or personal data. Several GitHub issues have been filed but mostly ignored; one official response only said deletion would 'probably' be added eventually. For privacy-conscious developers, this is a significant red flag before signing up to the platform.
Build a Basic AI Agent from Scratch: Long Task Planning
Hacker News (AI keywords)5 days agoTutorial
This source appears to be a tutorial about constructing a basic AI agent from scratch. Based only on the title, its focus is likely long-task planning: how an agent breaks a larger objective into steps and works through them over time. No article body was provided, so specific implementation choices, model providers, tools, code examples, or evaluation results cannot be confirmed.
Single-slot half-height PCIe V100 with NVLink appears in China
r/LocalLLaMA top day5 days agoHardware
A r/LocalLLaMA post says a Bilibili creator has shown a single-slot, half-height PCIe V100 with NVLink on a custom PCB. The card is described as 16 cm long, passively cooled by default, capped at 75W, with another version supporting up to 300W. The 16GB model is expected around or below ¥1500, with a 32GB version reportedly planned, but it is not yet available for purchase.
Apple’s AI promises are finally, almost, sort of, here★ 72
The Verge AI5 days agoCommentary
Apple kicked off its annual developer conference with bold AI promises centered around a revamped "Siri AI" and Apple Intelligence. While CEO Tim Cook touted these as boundary-pushing innovations, the announcements largely represent Apple playing catch-up in the generative AI race. The slow, phased rollout suggests Apple is still struggling to match the rapid pace of competitors like Microsoft and Google.
Rick & Morty
r/LocalLLaMA top day5 days agoCommentary
This r/LocalLLaMA top-day post is a short image meme titled “Rick & Morty.” The only accompanying text says, “nobody expected HF there,” suggesting surprise at HF appearing in the image’s context. There are no technical claims, model details, releases, or benchmarks, so its value is mainly as a small signal of community culture around Hugging Face / HF and local LLM discussions.
Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85
Google DeepMind Blog5 days agoRelease
Google DeepMind has unveiled Gemma 4 12B, a next-generation open-weights model featuring a unified, encoder-free multimodal architecture. By eliminating the traditional separate vision encoder (such as ViT), it processes diverse modalities directly within a single Transformer network. This design simplifies training, reduces inference latency, and enhances cross-modal alignment, marking a significant milestone for open-source AI.
PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs
Hacker News (AI keywords)5 days agoPaper
This arXiv paper introduces PR-CAD, a framework for controllable and faithful text-to-CAD generation with large language models. It treats CAD creation and editing as one progressive refinement process rather than separate tasks. The authors curate an interaction dataset and report state-of-the-art controllability and faithfulness on public benchmarks.
Google DeepMind Launches Initiative to Power the Future of Robotics in Europe★ 70
Google DeepMind Blog5 days agoBusiness
Google DeepMind has unveiled a strategic initiative to power the future of robotics in Europe. The program focuses on advancing Embodied AI and physical AI through deep collaborations with European academic institutions and industry partners. By combining DeepMind's AI expertise with Europe's strong engineering foundation, the initiative aims to accelerate breakthroughs in robotic generalization and safety.
PSA: Throttle GPU Power Limits for Major Energy Savings with Minimal Inference Performance Loss
r/LocalLLaMA top day5 days agoHardware
A Reddit user reminds the local LLM community that throttling GPU power limits offers outsized energy savings with minimal performance cost. On dual Radeon VII cards, cutting power from 250W to 100W per card resulted in less than 10% drop in inference speed. LLM inference is memory-bound rather than compute-bound, making it uniquely tolerant of reduced GPU clock speeds compared to training or rendering tasks.
Sandstone Raises $30M Series A to Bring AI to In-House Legal Teams
TechCrunch AI5 days agoBusiness
Legal tech startup Sandstone has raised $30 million in a Series A funding round. The round was led by Lightspeed Partners, with participation from Sequoia Capital. Sandstone plans to use the funds to develop and deploy AI-driven solutions tailored specifically for corporate in-house legal departments.
Apple’s Best AI Idea Looks a Lot Like Vibe Coding★ 75
The Verge AI5 days agoCommentary
While Apple's standard AI features like chatbots and image generation play catch-up, its integration of AI with Shortcuts stands out. By allowing users to generate complex multi-app workflows and automate Safari tabs using simple natural language, Apple is bringing "vibe coding" to the masses. This approach shifts the focus from generic AI assistants to highly personalized, OS-level task automation.
Apple Announced a New On-Device Inference Engine for Apple Silicon
r/LocalLLaMA top day5 days agoRelease
Apple announced CoreAI at WWDC, which the post frames as a possible future replacement for CoreML and an alternative to MLX, llama.cpp, and torch for optimized on-device inference. Models still need conversion through Python scripts, and current supported models appear mostly from mid-2025. No performance data is available yet; the author expects it may trail MLX on GPU, but Apple’s 20B on-device foundation model claim suggests larger app-bundled models could become possible.
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
Hacker News (AI keywords)5 days agoPaper
Echoing the famous Transformer paper, this work asks whether grep alone is sufficient for agentic search scenarios. The study focuses on 'agent harnesses'—the scaffolding wrapping an LLM, including prompting strategy, tool access, and memory—as the primary driver of search quality. Findings suggest harness design may matter more than the underlying model, challenging the community's focus on model scaling.
Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate
r/LocalLLaMA top day5 days agoNew Tool
Community developer maximecb has published bebelm, a Rust-native, GPU-free inference implementation of Liquid AI's LFM2.5-8B-A1B model, available on crates.io. Decode speed reaches ~37 tokens/s on a Ryzen 7950x with ~7GB memory footprint; prefill is unoptimized and currently similar in speed to decode. The library supports tool-use callbacks, weight sharing across multiple Agent instances with independent KV caches, and Agent cloning to skip repeated prefill on shared prompts.
Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75
Ars Technica AI5 days agoEthics
Apple clarified that running some of its AI models on Google's cloud infrastructure does not compromise user privacy. Through its Private Cloud Compute (PCC) architecture, Apple ensures that all data is processed in secure enclaves with end-to-end encryption. Consequently, Google has zero access to user data, addressing privacy concerns over Apple's cloud partnerships.
Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82
TechCrunch AI5 days agoBusiness
AI software development platform Lovable has surpassed $500 million in annualized run-rate revenue (ARR). The company reports that users are now launching over 1 million new projects per week on the platform. This rapid growth highlights a major shift, with users increasingly leveraging AI to build full-scale businesses and replace legacy internal software.
Apple’s AI pitch will live or die by its privacy promise
The Verge AI5 days agoCommentary
The Verge argues Apple’s WWDC 2026 AI strategy centers on privacy rather than raw capability. Apple says Siri AI and Apple Intelligence will run on-device when possible and use Private Cloud Compute only when needed. But reliance on Google Gemini, Google Cloud, Nvidia, Intel, and Google Titan hardware complicates Apple’s original privacy story, even if its default data collection remains more limited than rivals.
How an e-scooter founder raised $5 million to build space data centers
TechCrunch AI5 days agoBusiness
Euwyn Poon, co-founder of e-scooter company Spin, has raised $5 million for his new startup, Orbital. The company aims to launch 10,000 space-based data centers into orbit. By moving compute infrastructure into space, Orbital seeks to bypass Earth's power and cooling constraints while providing edge computing capabilities directly in orbit.
Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein
Hacker News (AI keywords)5 days agoNew Tool
Gravity is an interactive, web-based solar system simulator that lets users explore celestial mechanics in their browser. It uniquely bridges classical Newtonian physics and Einstein's general relativity, allowing users to visualize and compare orbital behaviors under different gravitational models. It serves as an engaging educational tool for physics enthusiasts and students alike.
CSS: The Unavoidable Bad Parts
Hacker News (AI keywords)5 days agoTutorial
The post frames CSS as learnable in a useful subset, but full of surprising defaults and edge cases. It covers semantic HTML, wrappers, layout, browser defaults, resets, classless CSS, selectors, box sizing, margins, flexbox, responsiveness, pixels, font sizing, line height, and word breaking. The advice is pragmatic: keep markup semantic, reset inconsistent defaults, understand layout constraints, and test readability across configurations.
Jetson Orin NX Build for Hermes Agent + Benchmarking
r/LocalLLaMA top day5 days agoHardware
The post describes turning an unused Jetson Orin NX into a compact local LLM server for Hermes Agent testing. The goals were low noise, over 10 tok/s generation, 300 tok/s prompt processing, at least 65K context, and a custom case. After testing Gemma 4, Qwen 3.6, and many quant variants, the author reports Gemma 4 26B A4B UD Q2_K_XL reaching 66K context and 10.21 tok/s near 60K context.
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72
Hugging Face Blog5 days agoTutorial
This Hugging Face blog post demonstrates how AI agents can use Spaces as modular tools. By chaining an image generation Space with a 3D rendering Space, an agent automatically generated art assets and placed them inside a virtual 3D gallery. This highlights the power of Hugging Face's ecosystem, where any Space can serve as an API for agentic workflows.
Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine
Hacker News (AI keywords)5 days agoTutorial
This post kicks off a series on building Catlantean 3D, a retro engine replicating 1993 graphics technology. The author bypasses modern GPUs to implement pure CPU software rendering, fixed-point math, and 256-color palettes. It offers a fascinating look into early 3D algorithms like raycasting and affine texture mapping, serving as an educational resource for low-level graphics.
Learning to lead in a hybrid human-AI enterprise
MIT Tech Review AI5 days agoBusiness
MIT Technology Review says AI agent adoption could surge by as much as 300% over the next two years. Unlike traditional automation that depends on manual input, agents can autonomously coordinate complex tasks across tools and environments. The article frames this as a leadership challenge: organizations must rethink workflows, oversight, roles, and governance for hybrid human-AI enterprises.
Amazon employees ask Seattle to put the brakes on new data centers
The Verge AI5 days agoRegulation
Seattle’s City Council is set to vote on a one-year moratorium on new large-scale data centers after five projects were proposed in the city. Amazon employees, other tech workers, engineers, and residents testified in support, citing electricity demand, water use, noise, housing, transparency, and AI safety concerns. Supporters want stricter rules around renewable energy, public resource reporting, developer disclosure, and worker-led oversight.
GentleOS: A Classic Operating System with a Lovely Retro GUI
Hacker News (AI keywords)5 days agoRelease
GentleOS (gentleos32) is an open-source hobby operating system project on GitHub featuring a charming retro GUI. Developed by luke8086, it offers a nostalgic look at classic OS design and GUI implementation. It serves as an engaging resource for retro computing enthusiasts and low-level system developers.
TinySearch v0.2.0: Lightweight Open Web-Search Tool for Local LLMs Now Defaults to SearXNG
r/LocalLLaMA top day5 days agoRelease
TinySearch is a lightweight open-source MCP/FastAPI tool that crawls, chunks, and reranks web results into an 8k-token context blob for small local LLMs. Version 0.2.0 replaces DuckDuckGo with SearXNG as the default backend after DDG began rate-limiting and CAPTCHAing automated requests. Users can point it at a self-hosted SearXNG instance; it integrates with Cline, Roo, and OpenCode agent setups.
Five things you need to know about AI
MIT Tech Review AI5 days agoCommentary
The article is based on a talk titled “Five things you need to know about AI,” delivered at SXSW London. The author frames it as a guide to the biggest AI themes right now, drawing partly from MIT Technology Review’s first AI10 list. From the provided excerpt, it reads as a trend-oriented editorial overview rather than a product release, paper, or technical tutorial.
Cleaning up after AI rockstar developers
Hacker News (AI keywords)5 days agoOpinion
The post explores the phenomenon of "AI rockstar developers" who use AI tools to write code at breakneck speed. While appearing highly productive, they often introduce significant technical debt and architectural mess. The author highlights the growing burden on teams to clean up this AI-generated code, emphasizing the need for rigorous code review and architectural oversight.

← PreviousPage 12Next →

Latest in AI

Microsoft AI Chief Mustafa Suleyman Walks Back Comments on AI Replacing White-Collar Jobs

Warning: OpenCode Go/Zen Has No Account or Data Deletion Feature

Build a Basic AI Agent from Scratch: Long Task Planning

Single-slot half-height PCIe V100 with NVLink appears in China

Apple’s AI promises are finally, almost, sort of, here★ 72

Rick & Morty

Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85

PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs

Google DeepMind Launches Initiative to Power the Future of Robotics in Europe★ 70

PSA: Throttle GPU Power Limits for Major Energy Savings with Minimal Inference Performance Loss

Sandstone Raises $30M Series A to Bring AI to In-House Legal Teams

Apple’s Best AI Idea Looks a Lot Like Vibe Coding★ 75

Apple Announced a New On-Device Inference Engine for Apple Silicon

Is Grep All You Need? How Agent Harnesses Reshape Agentic Search

Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate

Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75

Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82

Apple’s AI pitch will live or die by its privacy promise

How an e-scooter founder raised $5 million to build space data centers

Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein

CSS: The Unavoidable Bad Parts

Jetson Orin NX Build for Hermes Agent + Benchmarking

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72

Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine

Learning to lead in a hybrid human-AI enterprise

Amazon employees ask Seattle to put the brakes on new data centers

GentleOS: A Classic Operating System with a Lovely Retro GUI

TinySearch v0.2.0: Lightweight Open Web-Search Tool for Local LLMs Now Defaults to SearXNG

Five things you need to know about AI

Cleaning up after AI rockstar developers