Latest in AI

Showing:DevelopersClear ×

🔥 Trending today

open-source3 anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate
r/LocalLLaMA top day5 days agoNew Tool
Community developer maximecb has published bebelm, a Rust-native, GPU-free inference implementation of Liquid AI's LFM2.5-8B-A1B model, available on crates.io. Decode speed reaches ~37 tokens/s on a Ryzen 7950x with ~7GB memory footprint; prefill is unoptimized and currently similar in speed to decode. The library supports tool-use callbacks, weight sharing across multiple Agent instances with independent KV caches, and Agent cloning to skip repeated prefill on shared prompts.
Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75
Ars Technica AI5 days agoEthics
Apple clarified that running some of its AI models on Google's cloud infrastructure does not compromise user privacy. Through its Private Cloud Compute (PCC) architecture, Apple ensures that all data is processed in secure enclaves with end-to-end encryption. Consequently, Google has zero access to user data, addressing privacy concerns over Apple's cloud partnerships.
Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82
TechCrunch AI5 days agoBusiness
AI software development platform Lovable has surpassed $500 million in annualized run-rate revenue (ARR). The company reports that users are now launching over 1 million new projects per week on the platform. This rapid growth highlights a major shift, with users increasingly leveraging AI to build full-scale businesses and replace legacy internal software.
Apple’s AI pitch will live or die by its privacy promise
The Verge AI5 days agoCommentary
The Verge argues Apple’s WWDC 2026 AI strategy centers on privacy rather than raw capability. Apple says Siri AI and Apple Intelligence will run on-device when possible and use Private Cloud Compute only when needed. But reliance on Google Gemini, Google Cloud, Nvidia, Intel, and Google Titan hardware complicates Apple’s original privacy story, even if its default data collection remains more limited than rivals.
Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein
Hacker News (AI keywords)5 days agoNew Tool
Gravity is an interactive, web-based solar system simulator that lets users explore celestial mechanics in their browser. It uniquely bridges classical Newtonian physics and Einstein's general relativity, allowing users to visualize and compare orbital behaviors under different gravitational models. It serves as an engaging educational tool for physics enthusiasts and students alike.
CSS: The Unavoidable Bad Parts
Hacker News (AI keywords)5 days agoTutorial
The post frames CSS as learnable in a useful subset, but full of surprising defaults and edge cases. It covers semantic HTML, wrappers, layout, browser defaults, resets, classless CSS, selectors, box sizing, margins, flexbox, responsiveness, pixels, font sizing, line height, and word breaking. The advice is pragmatic: keep markup semantic, reset inconsistent defaults, understand layout constraints, and test readability across configurations.
Jetson Orin NX Build for Hermes Agent + Benchmarking
r/LocalLLaMA top day5 days agoHardware
The post describes turning an unused Jetson Orin NX into a compact local LLM server for Hermes Agent testing. The goals were low noise, over 10 tok/s generation, 300 tok/s prompt processing, at least 65K context, and a custom case. After testing Gemma 4, Qwen 3.6, and many quant variants, the author reports Gemma 4 26B A4B UD Q2_K_XL reaching 66K context and 10.21 tok/s near 60K context.
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72
Hugging Face Blog5 days agoTutorial
This Hugging Face blog post demonstrates how AI agents can use Spaces as modular tools. By chaining an image generation Space with a 3D rendering Space, an agent automatically generated art assets and placed them inside a virtual 3D gallery. This highlights the power of Hugging Face's ecosystem, where any Space can serve as an API for agentic workflows.
Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine
Hacker News (AI keywords)5 days agoTutorial
This post kicks off a series on building Catlantean 3D, a retro engine replicating 1993 graphics technology. The author bypasses modern GPUs to implement pure CPU software rendering, fixed-point math, and 256-color palettes. It offers a fascinating look into early 3D algorithms like raycasting and affine texture mapping, serving as an educational resource for low-level graphics.
Amazon employees ask Seattle to put the brakes on new data centers
The Verge AI5 days agoRegulation
Seattle’s City Council is set to vote on a one-year moratorium on new large-scale data centers after five projects were proposed in the city. Amazon employees, other tech workers, engineers, and residents testified in support, citing electricity demand, water use, noise, housing, transparency, and AI safety concerns. Supporters want stricter rules around renewable energy, public resource reporting, developer disclosure, and worker-led oversight.
GentleOS: A Classic Operating System with a Lovely Retro GUI
Hacker News (AI keywords)5 days agoRelease
GentleOS (gentleos32) is an open-source hobby operating system project on GitHub featuring a charming retro GUI. Developed by luke8086, it offers a nostalgic look at classic OS design and GUI implementation. It serves as an engaging resource for retro computing enthusiasts and low-level system developers.
TinySearch v0.2.0: Lightweight Open Web-Search Tool for Local LLMs Now Defaults to SearXNG
r/LocalLLaMA top day5 days agoRelease
TinySearch is a lightweight open-source MCP/FastAPI tool that crawls, chunks, and reranks web results into an 8k-token context blob for small local LLMs. Version 0.2.0 replaces DuckDuckGo with SearXNG as the default backend after DDG began rate-limiting and CAPTCHAing automated requests. Users can point it at a self-hosted SearXNG instance; it integrates with Cline, Roo, and OpenCode agent setups.
Five things you need to know about AI
MIT Tech Review AI5 days agoCommentary
The article is based on a talk titled “Five things you need to know about AI,” delivered at SXSW London. The author frames it as a guide to the biggest AI themes right now, drawing partly from MIT Technology Review’s first AI10 list. From the provided excerpt, it reads as a trend-oriented editorial overview rather than a product release, paper, or technical tutorial.
Cleaning up after AI rockstar developers
Hacker News (AI keywords)5 days agoOpinion
The post explores the phenomenon of "AI rockstar developers" who use AI tools to write code at breakneck speed. While appearing highly productive, they often introduce significant technical debt and architectural mess. The author highlights the growing burden on teams to clean up this AI-generated code, emphasizing the need for rigorous code review and architectural oversight.
NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain
Hugging Face Blog5 days agoNew Tool
NeuroBait is a Hugging Face community project built to help with ADHD task-initiation freeze rather than diagnosis or to-do planning. It fine-tunes google/gemma-3-12b-it with LoRA to produce short, warm, context-aware nudges. The project uses Unsloth and Modal for training, then deploys on a Hugging Face Space with Gradio, transformers, peft, and a runtime LoRA adapter.
Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70
量子位 QbitAI5 days agoRelease
Amap has released ABot-Earth 0.5, its latest spatial intelligence model. Moving beyond traditional 2D distillation methods (like Score Distillation Sampling), the model adopts a 3D native driving architecture. This breakthrough addresses multi-view inconsistency and distortion, enabling highly consistent 3D scene generation for autonomous driving simulation, smart cities, and digital twin mapping.
DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions
量子位 QbitAI5 days agoHardware
QbitAI reports that DeepSeek has listed an IDC design and planning engineer role covering data center campuses, power, cooling, networking, and capacity planning. The job description mentions participation in MW-to-GW-scale infrastructure and technologies such as dense GPU clusters, liquid cooling, smart operations, and digital twins. The article interprets this as a sign that DeepSeek may be moving beyond rented compute toward self-built AI infrastructure.
Is a New Player Joining China’s Top-Tier General AI Models?
量子位 QbitAI5 days agoCommentary
Based only on the title, the article likely examines China’s domestic general-purpose AI model landscape and asks whether a new company or model is entering the top tier. It appears to be an industry observation rather than a technical paper or tutorial. Without the full text, the specific model, company, benchmark evidence, and business context cannot be verified.
Xiaohongshu Is Growing a GitHub for AI Skills
量子位 QbitAI5 days agoNew Tool
QbitAI reports that Xiaohongshu is testing RED Skill, letting creators attach AI Skills directly under posts. Users can open a Skill page and copy it into assistants such as Codex, Claude Code, or OpenClaw. Nearly 1,000 original Skills have appeared during testing, spanning PPTs, interviews, papers, fitness, travel, and lifestyle use cases, with broader creator rollout expected in July.
A 4B Edge-Deployable Cognitive Model Built in China
量子位 QbitAI5 days agoRelease
QbitAI’s headline says a domestic Chinese team has built a 4B-parameter “cognitive model” suitable for edge deployment. The framing links it to a model direction previously associated with Andrej Karpathy. Since the article body was not provided, details such as the model name, architecture, benchmark results, hardware requirements, open-source status, and licensing remain unverified.
Tencent Wants Enterprises to Access AI in Only One Way
量子位 QbitAI5 days agoBusiness
The original article text is unavailable, so this can only be inferred from the headline. It likely discusses Tencent’s attempt to make enterprise AI adoption revolve around a single platform, entry point, or workflow. The key implication is business-strategic rather than technical: enterprise AI competition may be shifting from standalone models to integrated, managed platforms.
Voice AI for Greece
ElevenLabs Blog5 days agoBusiness
ElevenLabs published a blog post titled “Voice AI for Greece” on June 9, 2026. Without the article body, the confirmed scope is limited to ElevenLabs, Voice AI, and a Greece-related context. It may be relevant to readers tracking multilingual voice generation, localization, and regional AI adoption, but no specific feature, partnership, or model claim can be verified from the title alone.
Microsoft's open source tools were hacked to steal passwords of AI developers★ 78
Hacker News (AI keywords)5 days agoIncident
Microsoft temporarily removed several open source GitHub projects while investigating suspected malicious content. The affected repos were linked to Azure and developer workflows involving AI coding tools such as Claude Code, Gemini CLI, and VS Code. Security researchers said the malware could steal passwords and sensitive credentials when compromised tools were opened, though Microsoft has not disclosed how many users were affected.
Claude Fable 5 Now Available on Vercel AI Gateway
Vercel Changelog5 days agoRelease
Vercel has added Claude Fable 5 to its AI Gateway, enabling developers to call Anthropic's newest flagship model through a unified API proxy without managing separate credentials. AI Gateway handles cross-provider routing, monitoring, and cost tracking — upgrading to Fable 5 requires only a model ID change. This routine availability update lowers the barrier for Vercel-hosted AI apps to adopt the latest model capabilities in production.
FrontierCode: Benchmarking for Code Quality over Slop
Latent Space5 days agoBenchmark
Latent Space briefly announced FrontierCode with the line “We made a thing!” From the title, FrontierCode appears to be a benchmark for frontier coding systems that prioritizes code quality rather than sheer code generation volume. The provided excerpt does not include methodology, model results, datasets, or tooling details, so conclusions should remain cautious.
Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70
Cloudflare Blog5 days agoCommentary
Cloudflare introduces its defense architecture under Project Glasswing, arguing that robust architectural defense around vulnerabilities is more critical than patching speed. By acting as its own "customer zero," Cloudflare demonstrates how to mitigate autonomous frontier cyber models through edge-based isolation, zero-trust principles, and proactive traffic filtering.
L'Affaire Siloxane
Hacker News (AI keywords)5 days agoCommentary
Pinboard founder and prominent tech critic Maciej Cegłowski published a piece titled in the style of historical French scandals, suggesting a serious controversy worth scrutiny. The word 'Siloxane' — a silicon-oxygen chemical compound and basis of silicone — likely serves as a metaphor or pseudonym for a tech or AI entity. Original article content was unavailable; details must be confirmed by reading the source directly.
Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?
r/LocalLLaMA top day5 days agoBenchmark
A r/LocalLLaMA user is looking for benchmarks comparing Gemma 4 4-bit QAT models, via Unsloth, against standard 8-bit non-QAT quantized models. They understand QAT is expected to preserve much of the BF16 baseline accuracy, but want hard numbers against traditional 8-bit PTQ. The post highlights scattered feedback but no clear head-to-head evaluation yet.
ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR
r/LocalLLaMA top day6 days agoBenchmark
llama.cpp PR #24225 improves ggml-webgpu matrix multiplication performance for k-quants and refactors matmul paths for Q4/Q5/Q8 and k-quants. In pp512 tests on an M2 Pro, reported speedups range from about 1.33x to 3.78x across Q2_K, Q3_K, Q4_K, Q5_K, and Q6_K. The largest gains appear on Q3_K models, including Qwen and Gemma examples.
Why Apple’s slow-and-steady AI bet is starting to look pretty smart
TechCrunch AI6 days agoCommentary
The piece revisits criticism that Apple has fallen behind in the AI race, especially around Siri and Apple Intelligence. It argues that Apple’s slower approach could look smarter as the industry moves beyond flashy demos toward reliable, integrated user experiences. The key idea is that Apple’s ecosystem, device control, privacy positioning, and developer reach may matter more than racing to ship standalone AI chatbots.

← PreviousPage 9Next →

Latest in AI

Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate

Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75

Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82

Apple’s AI pitch will live or die by its privacy promise

Show HN: Gravity – Interactive Solar-System Simulator from Newton to Einstein

CSS: The Unavoidable Bad Parts

Jetson Orin NX Build for Hermes Agent + Benchmarking

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72

Making Graphics Like it's 1993: Building the Catlantean 3D Retro Engine

Amazon employees ask Seattle to put the brakes on new data centers

GentleOS: A Classic Operating System with a Lovely Retro GUI

TinySearch v0.2.0: Lightweight Open Web-Search Tool for Local LLMs Now Defaults to SearXNG

Five things you need to know about AI

Cleaning up after AI rockstar developers

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70

DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions

Is a New Player Joining China’s Top-Tier General AI Models?

Xiaohongshu Is Growing a GitHub for AI Skills

A 4B Edge-Deployable Cognitive Model Built in China

Tencent Wants Enterprises to Access AI in Only One Way

Voice AI for Greece

Microsoft's open source tools were hacked to steal passwords of AI developers★ 78

Claude Fable 5 Now Available on Vercel AI Gateway

FrontierCode: Benchmarking for Code Quality over Slop

Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70

L'Affaire Siloxane

Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?

ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR

Why Apple’s slow-and-steady AI bet is starting to look pretty smart