Latest in AI

Showing:ResearchersClear ×

🔥 Trending today

anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 open-source2 geopolitics2 python-packaging2 webassembly2 pyodide2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students
Hacker News (AI keywords)7 days agoEthics
San Diego State University reportedly deployed around 1,300 AI-enabled cameras across campus, including roughly 330 tied to student dorm areas. The controversy centers on whether students were adequately informed and whether residential common areas should be treated as ordinary surveillance zones. With no full article text provided, the strongest reading is that this is an AI governance and privacy incident, not a model or product launch story.
Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75
r/LocalLLaMA top day7 days agoBenchmark
A Reddit user shared benchmark results showing Google's Gemma 4 31B (FP8) performing on par with Claude Sonnet 4.6 Medium. The custom evaluation harness tested complex tasks including Neo4j Cypher queries, entity extraction, agentic tool calling, Python coding, and multi-vector retrieval synthesis. This highlights how quantized mid-sized open-source models are closing the gap with leading proprietary frontier models.
Jensen Huang Signs Korea Deals with SK Hynix, NAVER and Doosan
INSIDE 硬塞 AI7 days agoHardware
Nvidia announced partnerships with SK Hynix, NAVER and Doosan Group to bring its technology into AI data center projects in Korea. The collaboration also covers next-generation memory development, tying Nvidia more closely to Korea’s semiconductor and digital infrastructure ecosystem. The article does not specify investment size, deployment timeline or data center scale.
Best Local TTS Solution
r/LocalLLaMA top day7 days agoCommentary
A r/LocalLLaMA user says they have tested many local TTS tools, but none match ElevenLabs for expressiveness, voices, and cloning. They list moss-nano and Kokoro as the best edge-device candidates so far, with edgeTTS as a free/cloud option. The post asks for community experience connecting agents such as Hermes, openclaw, or opencode to Telegram voice notes or real-time voice conversations.
Algorithmic Monocultures in Hiring★ 78
Hacker News (AI keywords)7 days agoPaper
This study analyzes 3.4 million real applicants and 4 million applications across 156 U.S. employers. It finds position-level racial adverse impact that aggregate analysis can obscure, especially affecting Black and Asian applicants. The authors also show that reliance on a single vendor can create homogeneous outcomes and systemic rejections, calling for stronger audits, surveillance, and researcher access.
DeepSeek V4 Pro beats GPT-5.5 Pro on precision
Hacker News (AI keywords)7 days agoBenchmark
RuntimeWire compared DeepSeek V4 Pro and GPT-5.5 Pro across four fresh text tasks, with DeepSeek winning 38.0 to 33.0. The article highlights DeepSeek’s stronger handling of regex edge cases, workplace-update constraints, and exact JSON schema compliance. GPT-5.5 Pro remained capable, but lost points for avoidable deviations, extra process details, and minor structural mismatches.
A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W
Hacker News (AI keywords)7 days agoHardware
This GitHub repository collects Rust Embassy examples for Raspberry Pi Pico 2 and Pico 2 W. Its Matter Wi-Fi light example uses rs-matter, BLE commissioning, and Wi-Fi connectivity so the board can appear as a standard smart bulb in Home Assistant, Apple Home, or Google Home. The project is mainly relevant to embedded Rust and smart-home developers, not AI model users.
The Open Source Community is backing OpenEnv for Agentic RL
Hugging Face Blog7 days agoCommentary
The title indicates that OpenEnv is being positioned around agentic reinforcement learning. The confirmed signal is community support from the open-source ecosystem, not specific technical claims. Without the full article, details such as contributors, features, integrations, benchmarks, or adoption status should be treated as unknown.
datasette-agent-edit 0.1a0
Simon Willison's Weblog7 days agoRelease
Simon Willison released datasette-agent-edit 0.1a0 as a base plugin for Datasette Agent. It is intended to support future plugins that edit existing text, including collaborative Markdown, large SQL queries, and SVG files. The design follows Claude’s text editor tool pattern, exposing view, str_replace, and insert primitives so other plugins can reuse a stricter editing workflow.
"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts
r/LocalLLaMA top day7 days agoCommentary
A popular Reddit post highlights a video demonstrating a "Fully Hallucinated Operating System" run entirely inside an LLM. By prompting the model to act as a terminal, it simulates file systems, network requests, and command execution purely through text generation. While impractical for production, this experiment showcases the impressive state-tracking and "world model" capabilities of modern LLMs.
How much do amd64 microarchitecture levels help in Go?
Hacker News (AI keywords)7 days agoBenchmark
Daniel Lemire tests Go’s GOAMD64 levels using Roaring Bitmaps on a modern Intel Xeon. v2 brings strong gains where popcnt matters, while v3 adds further speedups in dense bitmap and set-operation workloads through AVX2. v4, despite implying AVX-512 support, shows no meaningful improvement in these benchmarks, likely due to current Go compiler limitations.
Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models
r/LocalLLaMA top day7 days agoBenchmark
A community benchmark of Qwen 3.6 27B on DeepSWE yielded a score of 1.79% (18/20th place), slightly outperforming Haiku 4.5. Run on a single RTX 6000 Blackwell GPU via vLLM with reasoning enabled, the test averaged 32 minutes and 44k output tokens per task. The author notes that while Qwen 3.6 27B represents a 'poor man's local SOTA,' the massive gap compared to frontier closed models suggests local LLMs are struggling to keep pace in complex coding.
Amazing Digital Dentures (a failed project)
Hugging Face Blog7 days agoCommentary
The post appears to discuss a project called “Amazing Digital Dentures,” explicitly framed as a failed project. Because the article body was not provided, the specific technical stack, models, tools, datasets, and reasons for failure cannot be verified. Based on the title and URL path, it may be a hackathon-style project retrospective focused on prototyping challenges and lessons learned.
Exploring 2-bit QAT: Can Ultra-Compressed Large Models Outperform 4-bit Models Half Their Size?
r/LocalLLaMA top day7 days agoCommentary
A popular Reddit thread on r/LocalLLaMA discusses the potential of 2-bit Quantization Aware Training (QAT) for large MoE models (120B to 400B). While current QAT efforts focus on 4-bit, users speculate whether a 2-bit QAT model could fit into consumer hardware (64GB/128GB RAM) and outperform a 4-bit model of half its size. This approach is proposed as a practical alternative to training ternary (1.58-bit) LLMs from scratch.
If LLMs Have Human-Like Attributes, Then So Does Age of Empires II
Hacker News (AI keywords)7 days agoPaper
The paper argues that claims about LLMs having human-like attributes, such as morality or language understanding, can be methodologically fragile. By building and training a simple neural network on Age of Empires II, the author suggests such attributes may not be empirically unique to LLMs. The key recommendation is to define explicit measurement criteria and use a null assumption of LLM non-uniqueness before drawing anthropomorphic conclusions.
Gemma-4-26B-A4B QAT Variant Performs Poorly in llama.cpp Compared to Non-QAT Version
r/LocalLLaMA top day7 days agoBenchmark
A LocalLLaMA user highlighted that the newly released QAT (Quantization-Aware Training) variant of Google's Gemma-4-26B-A4B model underperforms compared to its non-QAT predecessor. Testing via llama.cpp on a chessboard SVG generation task showed significant rendering errors in the QAT version. The non-QAT GGUF version, however, produced highly accurate results under identical settings.
Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas
Hacker News (AI keywords)7 days agoNew Tool
office-open-xml-viewer is an open-source browser viewer for Office Open XML documents, rendering DOCX, XLSX, and PPTX files to HTML Canvas. Its parsers are written in Rust and compiled to WebAssembly, while rendering uses the Canvas 2D API. The README also says the full codebase was implemented by Claude through iterative prompting, making it notable as an AI-assisted software development case.
Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)
r/LocalLLaMA top day7 days agoNew Tool
Developer Yuntian Deng introduced "programasweights," a framework that compiles plain-English descriptions into tiny, local action programs (loops, parallel tracks) to control 3D avatars. Instead of pre-defined buttons, users can command complex sequences like "wave while walking, then jump." The runtime code is open-source and runs entirely offline in the browser or via Python.
Iran Severely Damaged US Air Ops Center in Qatar Soon After War Began
Hacker News (AI keywords)7 days agoIncident
Air & Space Forces Magazine reports that multiple Iranian missiles hit the Combined Air Operations Center at Al Udeid Air Base in Qatar early in the U.S.-Iran war. The facility was reportedly not in use, no injuries were reported, and the air campaign continued from Shaw Air Force Base in South Carolina. The incident raises questions about rebuilding, hardening, dispersing, and networking forward command nodes under missile and drone threats.
sqlite: A CGo-free port of SQLite/SQLite3
Hacker News (AI keywords)7 days agoRelease
This project provides a CGo-free SQLite/SQLite3 implementation for Go, useful when developers want pure-Go builds and simpler cross-platform deployment. It keeps the familiar SQLite embedded database model while integrating with Go’s database/sql workflow. Recent releases upgraded SQLite, improved text/time scanning performance, added backup progress helpers, and expanded virtual table and sqlite-vec related support.
Anthropic, please ship an official Claude Desktop for Linux
Hacker News (AI keywords)7 days agoOpinion
The available source only provides the title, which asks Anthropic to ship an official Claude Desktop app for Linux. It appears to be a community feature request rather than a confirmed product announcement. Without the issue body or official response, there is no basis to infer Anthropic’s plans, timeline, or technical reasoning.
llama.cpp Gemma4 MTP Support Merged
r/LocalLLaMA top day7 days agoRelease
llama.cpp PR #23398 was merged on June 7, 2026, adding MTP support for Gemma4 models. The author reports over 2x average speedup on dense models, no observed speedup on MoE, and replicated AIME-26 results around 87%. Support currently covers 31B and 26B-4B variants, while E4B and E2B are not supported yet; multi-GPU may need extra draft-device configuration.
Qwen 3.6 27B KV Cache Quantization Benchmarks: KVarN, Turbo, and TCQ Evaluated
r/LocalLLaMA top day7 days agoBenchmark
Reddit user Anbeeld shared comprehensive KV cache quantization benchmarks for Qwen 3.6 27B across 75 configuration pairs. Using BeeLlama.cpp (a custom llama.cpp fork), the test evaluates q8, q6, q5, and q4 quantization levels. It specifically highlights advanced implementations like KVarN, TurboQuant, and TCQ to optimize long-context inference efficiency.
Show HN: Lathe - Use LLMs to learn a new domain, not skip past it
Hacker News (AI keywords)7 days agoNew Tool
Lathe is an open-source tool for generating hands-on technical tutorials with LLM skills. It combines a Go CLI, local reading UI, and commands for asking questions, extending tutorials, and verifying outputs. The project supports Claude Code, Cursor, and Codex workflows, with an emphasis on learning by typing and reasoning through the material yourself.
School shooting survivor sues AI gun detection firm after system failed
Ars Technica AI7 days agoIncident
A teen injured in a January 2025 Nashville high school shooting has sued Omnilert and reseller System Integrations. The lawsuit alleges the company knew or should have known its AI gun detection system could fail under real-world camera, lighting, angle, distance, and visibility limits. The case raises questions about marketing claims, public safety procurement, and accountability when AI security tools fail in emergencies.
Her · हेर — a detective for your Claude Code sessions
Hugging Face Blog7 days agoNew Tool
The title presents Her · हेर as a detective for Claude Code sessions. Because the article body is unavailable, its actual features, setup, and implementation details cannot be verified. Conservatively, it appears relevant to developers who want better visibility into what happened during AI-assisted coding sessions.
Clustering 3x Jetson Nano Orin Supers for Distributed AI
r/LocalLLaMA top day7 days agoTutorial
A developer has shared a practical guide on clustering three NVIDIA Jetson Nano Orin Super boards, leveraging their Ampere CUDA cores and unified memory. This project is part of 'smolcluster,' an initiative to make distributed AI training and inference accessible using everyday hardware like Macs, Raspberry Pis, and Jetsons. The series aims to explore whether heterogeneous clusters (mixing different hardware architectures) can effectively run local LLMs.
NVIDIA, KRAFTON, NC and T1 Celebrate RTX Spark at Korea’s PC Bangs
NVIDIA Blog7 days agoHardware
After unveiling RTX Spark at GTC Taipei during COMPUTEX, NVIDIA brought the platform to South Korea’s gaming community. Jensen Huang visited T1 Base Camp and PC bangs in Seoul to show how RTX Spark targets local AI, creation and high-performance gaming on slim Windows laptops and compact desktops. Demos included League of Legends, VALORANT, PUBG, Subnautica 2, CINDER CITY, AION 2 and an unreleased NVIDIA ACE-powered PUBG Ally character.
How Liminalism Became the Defining Aesthetic of Our Time
Hacker News (AI keywords)8 days agoCommentary
The article argues that Liminalism has become a major visual language for alienation, nostalgia, and late-capitalist unease. It traces the aesthetic from abandoned malls and The Backrooms to COVID-era empty-city imagery and older art-historical precedents such as Surrealism and Edward Hopper. It also notes that many liminal-space communities prohibit AI-generated images, favoring unsettling real-world found photography.
Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering
Hacker News (AI keywords)8 days agoPaper
This arXiv paper studies token consumption in LLM-based multi-agent software engineering. Using 30 ChatDev tasks with a GPT-5 reasoning model, the authors map internal phases to SDLC stages such as design, coding, review, testing, and documentation. Preliminary results suggest code review dominates token usage, averaging 59.4%, while input tokens form the largest share, pointing to inefficiencies in agent collaboration.

← PreviousPage 11Next →

Latest in AI

SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students

Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75

Jensen Huang Signs Korea Deals with SK Hynix, NAVER and Doosan

Best Local TTS Solution

Algorithmic Monocultures in Hiring★ 78

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W

The Open Source Community is backing OpenEnv for Agentic RL

datasette-agent-edit 0.1a0

"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts

How much do amd64 microarchitecture levels help in Go?

Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models

Amazing Digital Dentures (a failed project)

Exploring 2-bit QAT: Can Ultra-Compressed Large Models Outperform 4-bit Models Half Their Size?

If LLMs Have Human-Like Attributes, Then So Does Age of Empires II

Gemma-4-26B-A4B QAT Variant Performs Poorly in llama.cpp Compared to Non-QAT Version

Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas

Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)

Iran Severely Damaged US Air Ops Center in Qatar Soon After War Began

sqlite: A CGo-free port of SQLite/SQLite3

Anthropic, please ship an official Claude Desktop for Linux

llama.cpp Gemma4 MTP Support Merged

Qwen 3.6 27B KV Cache Quantization Benchmarks: KVarN, Turbo, and TCQ Evaluated

Show HN: Lathe - Use LLMs to learn a new domain, not skip past it

School shooting survivor sues AI gun detection firm after system failed

Her · हेर — a detective for your Claude Code sessions

Clustering 3x Jetson Nano Orin Supers for Distributed AI

NVIDIA, KRAFTON, NC and T1 Celebrate RTX Spark at Korea’s PC Bangs

How Liminalism Became the Defining Aesthetic of Our Time

Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering