Latest in AI

Showing:DevelopersClear ×

🔥 Trending today

open-source3 ai-policy2 anthropic2 privacy2 ipo1 ai-investment1 public-markets1 startup-ecosystem1 venture-capital1 national-security1

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

llama.cpp Gemma4 MTP Support Merged
r/LocalLLaMA top day8 days agoRelease
llama.cpp PR #23398 was merged on June 7, 2026, adding MTP support for Gemma4 models. The author reports over 2x average speedup on dense models, no observed speedup on MoE, and replicated AIME-26 results around 87%. Support currently covers 31B and 26B-4B variants, while E4B and E2B are not supported yet; multi-GPU may need extra draft-device configuration.
LLMs are eroding my software engineering career and I don't know what to do
Hacker News (AI keywords)8 days agoOpinion
The author argues that LLMs are eroding three pillars of his software engineering career: domain knowledge, debugging skill, and architecture judgment. Tools like ChatGPT, Claude, Claude Code, Codex, MCP, Sentry MCP, and DataDog MCP increasingly handle design, implementation, and difficult production bugs. The essay frames this as a labor-market concern, not just a tooling debate: if expertise becomes promptable, engineers may struggle to remain differentiated.
Qwen 3.6 27B KV Cache Quantization Benchmarks: KVarN, Turbo, and TCQ Evaluated
r/LocalLLaMA top day8 days agoBenchmark
Reddit user Anbeeld shared comprehensive KV cache quantization benchmarks for Qwen 3.6 27B across 75 configuration pairs. Using BeeLlama.cpp (a custom llama.cpp fork), the test evaluates q8, q6, q5, and q4 quantization levels. It specifically highlights advanced implementations like KVarN, TurboQuant, and TCQ to optimize long-context inference efficiency.
Sponsor OpenAI Codex Voucher Usage for the OpenAI Challenge
Hugging Face Blog8 days agoTutorial
This Hugging Face Blog entry appears to relate to sponsor vouchers for the Build Small Hackathon, specifically OpenAI Codex voucher usage. Because the original body text is unavailable, details such as eligibility, value, deadlines, and supported tools cannot be confirmed. It is best treated as a likely participant guide rather than a major product announcement.
Show HN: Lathe - Use LLMs to learn a new domain, not skip past it
Hacker News (AI keywords)8 days agoNew Tool
Lathe is an open-source tool for generating hands-on technical tutorials with LLM skills. It combines a Go CLI, local reading UI, and commands for asking questions, extending tutorials, and verifying outputs. The project supports Claude Code, Cursor, and Codex workflows, with an emphasis on learning by typing and reasoning through the material yourself.
Dockerized Nemotron 3.5 ASR: Better Multilingual Support & Streaming (4.5x CPU Speed)
r/LocalLLaMA top day8 days agoNew Tool
A developer on Reddit shared a Dockerized implementation of Nemotron 3.5 ASR, migrating from Parakeet. The system supports over 40 languages and features a native streaming architecture that avoids full-file buffering. Using the onnxruntime-genai backend, it achieves 4.5x real-time speed on CPU, with CUDA support planned but untested.
Her · हेर — a detective for your Claude Code sessions
Hugging Face Blog8 days agoNew Tool
The title presents Her · हेर as a detective for Claude Code sessions. Because the article body is unavailable, its actual features, setup, and implementation details cannot be verified. Conservatively, it appears relevant to developers who want better visibility into what happened during AI-assisted coding sessions.
Clustering 3x Jetson Nano Orin Supers for Distributed AI
r/LocalLLaMA top day8 days agoTutorial
A developer has shared a practical guide on clustering three NVIDIA Jetson Nano Orin Super boards, leveraging their Ampere CUDA cores and unified memory. This project is part of 'smolcluster,' an initiative to make distributed AI training and inference accessible using everyday hardware like Macs, Raspberry Pis, and Jetsons. The series aims to explore whether heterogeneous clusters (mixing different hardware architectures) can effectively run local LLMs.
NVIDIA, KRAFTON, NC and T1 Celebrate RTX Spark at Korea’s PC Bangs
NVIDIA Blog8 days agoHardware
After unveiling RTX Spark at GTC Taipei during COMPUTEX, NVIDIA brought the platform to South Korea’s gaming community. Jensen Huang visited T1 Base Camp and PC bangs in Seoul to show how RTX Spark targets local AI, creation and high-performance gaming on slim Windows laptops and compact desktops. Demos included League of Legends, VALORANT, PUBG, Subnautica 2, CINDER CITY, AION 2 and an unreleased NVIDIA ACE-powered PUBG Ally character.
I design with Claude more than Figma now
Hacker News (AI keywords)8 days agoOpinion
Jane Street designer Edwin Morris describes moving from skepticism about LLMs to using Claude as a core design tool. Instead of relying mainly on specs and Figma mockups, he now builds working prototypes directly in the real codebase. The post also explores the collaboration risks: prototypes must remain disposable proposals, not finished features that shut reviewers out of design input.
Major P2P Issues in Israel and Possibly Other Middle East Countries
Hacker News (AI keywords)8 days agoIncident
A GitHub issue in ValveSoftware/GameNetworkingSockets reports major P2P issues affecting Israel and possibly other Middle East countries. No issue body was provided, so details such as root cause, versions, reproduction steps, and maintainer response are unknown. Developers using P2P networking should treat this as a regional connectivity incident worth monitoring, especially for games or real-time applications with Middle East users.
Show HN: Oproxy - inspect and modify network traffic from the browser
Hacker News (AI keywords)8 days agoNew Tool
Oproxy is a local HTTP, HTTPS, and SOCKS5 proxy with a browser-based management UI. It captures requests and responses, supports replay and Compose workflows, and can export HAR, cURL, Fetch, and Python snippets. Advanced features include HTTPS MITM, mock responses, throttling, breakpoints, DNS overrides, Lua scripts, and an OpenAI-compatible assistant for preparing confirmed proxy changes.
Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering
Hacker News (AI keywords)8 days agoPaper
This arXiv paper studies token consumption in LLM-based multi-agent software engineering. Using 30 ChatDev tasks with a GPT-5 reasoning model, the authors map internal phases to SDLC stages such as design, coding, review, testing, and documentation. Preliminary results suggest code review dominates token usage, averaging 59.4%, while input tokens form the largest share, pointing to inefficiencies in agent collaboration.
Passing DBs Through Continuations
Hacker News (AI keywords)8 days agoTutorial
The post explains how continuation-passing style can express database operators without materializing intermediate results. Using Prela and Julia examples, it shows list transformations, relational composition, product, scan, and probe being expanded through inlining. The result is modular query code that can compile into tight columnar loops, though the author notes assumptions around JIT cost and dense primary keys.
OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks★ 72
TechCrunch AI8 days agoRelease
OpenAI unveiled Lockdown Mode, a feature aimed at reducing the chance that sensitive data is shared during prompt injection attacks. The article notes that ChatGPT may still remain vulnerable even when the mode is enabled. That makes the feature a mitigation layer rather than a complete security guarantee, especially for teams handling private or business-critical information.
Sem: A Git-Based Primitive for Code Understanding, Not LSPs
Hacker News (AI keywords)8 days agoNew Tool
Sem is a CLI from Ataraxy Labs that layers semantic code understanding on top of Git. Instead of line-based diffs, it reports changed functions, classes, methods, and types. It offers diff, blame, impact, log, entities, and context commands, with JSON output and AI-oriented context generation, though its accuracy claims still need independent validation.
Five labs, five minds: building a multi-model finance drama on small models
Hugging Face Blog8 days agoCommentary
Based only on the title, the post likely describes a multi-model experiment where five model-like roles collaborate or clash in a finance-themed scenario. The emphasis appears to be on using small models rather than one large model, possibly to create a staged analytical or narrative experience. Without the article text, specific models, tools, architecture, and results cannot be verified.
Meta confirms thousands of Instagram accounts hacked via AI chatbot abuse★ 76
Hacker News (AI keywords)8 days agoIncident
Meta confirmed a vulnerability in Instagram’s AI-assisted account recovery system that let attackers redirect password reset links to attacker-controlled emails. At least 20,225 users were notified, with compromised accounts potentially exposing profile data, posts, direct messages, and activity. Meta says it has disabled the affected chatbot flow, removed the vulnerable code path, and asked impacted users to reset passwords through verified channels.
Job Searcher
Hugging Face Blog9 days agoNew Tool
Hugging Face Blog published a post titled “Job Searcher,” but no article body was provided here. Based on the title and URL context, it may be a Build Small Hackathon project related to job search or career assistance. Details such as model choice, implementation, features, evaluation, or availability cannot be confirmed from the supplied source text.
Police in England and Wales told to halt AI use in court statements
Hacker News (AI keywords)9 days agoRegulation
Based only on the headline, police in England and Wales have been told to halt AI use in court statements. The article text is unavailable, so the issuing authority, scope, rationale, and any specific incident cannot be confirmed. The topic points to broader concerns around accuracy, auditability, accountability, and procedural fairness when AI is used in legal or policing documents.
US House lawmakers release draft bill to prohibit state AI rules★ 72
Hacker News (AI keywords)9 days agoRegulation
Reuters’ headline indicates that US House lawmakers have released a draft bill focused on AI regulation. The key proposal appears to be prohibiting individual states from creating their own AI rules. Without the full article or bill text, details such as scope, sponsors, exemptions, enforcement, and legislative prospects cannot be confirmed.
Nvidia is proposing a beast of a CPU system for Windows PCs
Hacker News (AI keywords)9 days agoHardware
Based only on the title, Nvidia appears to be proposing a high-end CPU system for Windows PCs. That could signal deeper ambitions beyond GPUs and AI accelerators into the core PC platform. However, no article text is available, so the architecture, specs, partners, timing, and product positioning remain unconfirmed.
Meta Keeps Delaying the Release of Its New AI Model to Developers
Hacker News (AI keywords)9 days agoRelease
The WSJ reports that Meta has repeatedly delayed the developer release of a new AI model after previously signaling it would arrive “soon.” Public summaries say the delay has stretched for nearly two months, with no scheduled API launch date at the time of reporting. The story matters less as a benchmark claim and more as a signal about Meta’s AI execution, developer ecosystem strategy, and monetization timeline.
Here comes new Siri again
The Verge AI9 days agoCommentary
The Verge frames Apple as behind in AI, but argues that lagging may not be entirely bad. At WWDC, Apple appears ready to introduce the new Siri again after earlier Apple Intelligence promises slipped. The key question is whether Apple can turn AI into a reliable, system-level assistant experience rather than another generic chatbot feature set.
Persona Atlas: Mapping How Famous Minds Think
Hugging Face Blog9 days agoNew Tool
The title suggests Persona Atlas is a project focused on representing or exploring the thinking styles of famous figures. The source text is unavailable, so its format, methods, data, model use, and results cannot be verified. It may be relevant to persona modeling, AI role-play, conversational agents, or thought-style visualization, but the practical impact remains unclear without the full post.
LLM Research Papers: The 2026 List (January to May)
Ahead of AI (Raschka)9 days agoPaper
Sebastian Raschka compiles a curated reference list of LLM papers he bookmarked from January through May 2026. The list is not comprehensive, but organized around topics useful for future articles, lectures, code examples, and research work. Public sections emphasize reasoning, RL, efficient inference, long context, agent systems, tool use, coding agents, diffusion language models, and serving infrastructure.
Introduction – Rust for Python Programmers
Hacker News (AI keywords)9 days agoTutorial
This Hacker News item points to an introductory page for “Rust for Python Programmers” on Microsoft GitHub Pages. Based only on the title, it appears to be a learning resource designed to help Python developers approach Rust. No source content was provided, so details about chapters, examples, or coverage cannot be confirmed.
The Smart TV in Your Living Room Is a Node in the AI Scraping Economy★ 74
Hacker News (AI keywords)9 days agoEthics
Include Security examines how Bright Data’s SDK supplies residential proxy capacity through partner apps on phones and connected TVs. The post argues smart TVs are especially attractive because they are always powered, often on fast Wi-Fi, and rarely monitored. It details public configuration endpoints, peer tunnel behavior, telemetry, VPN visibility bypasses, bandwidth limits, and practical DNS or network-blocking defenses.
micropython-wasm 0.1a2
Simon Willison's Weblog9 days agoRelease
Simon Willison released micropython-wasm 0.1a2, with the main change being a new CLI. The CLI was added from issue #7 and was inspired while drafting a related post about MicroPython in a sandbox. Its purpose is to make the post's “Try it yourself” section easier to demonstrate and follow, especially for readers experimenting with Python, WebAssembly, and sandboxing.
Running Python code in a sandbox with MicroPython and WASM
Simon Willison's Weblog9 days agoNew Tool
Simon Willison describes his latest attempt to safely run Python plugin-style code inside his own applications. The alpha package micropython-wasm uses MicroPython compiled to WebAssembly, executed through the maintained wasmtime Python library. His goals include clean PyPI installation, CPU and memory limits, controlled file and network access, host functions, and reliable documentation.

← PreviousPage 17Next →

Latest in AI

llama.cpp Gemma4 MTP Support Merged

LLMs are eroding my software engineering career and I don't know what to do

Qwen 3.6 27B KV Cache Quantization Benchmarks: KVarN, Turbo, and TCQ Evaluated

Sponsor OpenAI Codex Voucher Usage for the OpenAI Challenge

Show HN: Lathe - Use LLMs to learn a new domain, not skip past it

Dockerized Nemotron 3.5 ASR: Better Multilingual Support & Streaming (4.5x CPU Speed)

Her · हेर — a detective for your Claude Code sessions

Clustering 3x Jetson Nano Orin Supers for Distributed AI

NVIDIA, KRAFTON, NC and T1 Celebrate RTX Spark at Korea’s PC Bangs

I design with Claude more than Figma now

Major P2P Issues in Israel and Possibly Other Middle East Countries

Show HN: Oproxy - inspect and modify network traffic from the browser

Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering

Passing DBs Through Continuations

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks★ 72

Sem: A Git-Based Primitive for Code Understanding, Not LSPs

Five labs, five minds: building a multi-model finance drama on small models

Meta confirms thousands of Instagram accounts hacked via AI chatbot abuse★ 76

Job Searcher

Police in England and Wales told to halt AI use in court statements

US House lawmakers release draft bill to prohibit state AI rules★ 72

Nvidia is proposing a beast of a CPU system for Windows PCs

Meta Keeps Delaying the Release of Its New AI Model to Developers

Here comes new Siri again

Persona Atlas: Mapping How Famous Minds Think

LLM Research Papers: The 2026 List (January to May)

Introduction – Rust for Python Programmers

The Smart TV in Your Living Room Is a Node in the AI Scraping Economy★ 74

micropython-wasm 0.1a2

Running Python code in a sandbox with MicroPython and WASM