Latest in AI

Showing:DevelopersClear ×

🔥 Trending today

anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 open-source2 geopolitics2 python-packaging2 webassembly2 pyodide2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Apple is fixing bill splitting with its new Siri in Camera feature
TechCrunch AI6 days agoRelease
Apple is working on a Siri in Camera feature aimed at simplifying bill splitting after meals. Users can point an iPhone at a restaurant bill, select what they ordered, and split the tab using Apple Cash. The provided source does not specify launch timing, regional availability, language support, or how the feature handles taxes, tips, or complex shared orders.
Ask HN: What are tools you have made for yourself since the advent of AI?
Hacker News (AI keywords)6 days agoCommentary
This Ask HN post invites the community to share tools they have built for themselves in the AI era. No original discussion content or replies were provided, so only the topic can be assessed. The likely value is inspirational rather than definitive: it may surface personal automation ideas, workflow hacks, and AI-assisted software experiments, but no specific tools or models can be confirmed from the title alone.
Siri AI
Hacker News (AI keywords)6 days agoRelease
Apple’s Apple Intelligence page presents Siri AI as a more capable assistant with natural conversations, personal context, cross-app actions, and a dedicated app. It also highlights Visual Intelligence across iPhone, iPad, Mac, and Apple Vision Pro, plus AI photo and image tools. Since the HN item provides only the title, this should be treated as a product preview rather than a technical deep dive.
When every other post is an AI benchmark, best-model question, or slop app
r/LocalLLaMA top day6 days agoCommentary
This r/LocalLLaMA post is a meme-like complaint about the subreddit’s recent content quality. The author points to repeated AI-generated benchmark reports, recurring “best model” questions, and hastily built apps or engines presented as groundbreaking. It is not a technical release or evidence-based analysis, but it reflects frustration with noise, hype, and low-effort AI-generated discussion in local model communities.
Apple’s long-awaited AI Siri overhaul is finally here★ 72
TechCrunch AI6 days agoRelease
TechCrunch reports that Apple’s long-awaited AI overhaul of Siri has arrived. The idea behind the new “Siri AI” is to shift Siri beyond a voice-controlled assistant into an AI companion that can do more. The provided article text does not specify concrete features, supported devices, rollout timing, or technical details.
Full Reverse Engineering of the TI-84 Plus Operating System
Hacker News (AI keywords)6 days agoHardware
This Hacker News item links to an article titled “Full Reverse Engineering of the TI-84 Plus Operating System.” Based on the provided material, the reliable takeaway is that it concerns reverse engineering the OS of Texas Instruments’ TI-84 Plus graphing calculator. The original text was not provided, so specific claims about methods, findings, code, memory layout, or security implications cannot be verified here.
Apple announces Siri AI and its next generation of Apple Intelligence★ 72
The Verge AI6 days agoRelease
Apple revealed a new round of AI features at WWDC, centered on a smarter and more personalized Siri. The announcement comes two years after Apple first outlined Apple Intelligence and a more capable Siri that The Verge says never fully materialized. Apple describes Siri AI as an entirely new version of Siri, with stronger conversational ability and broader capabilities.
LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs
r/LocalLLaMA top day6 days agoOpinion
A popular r/LocalLLaMA post urges local LLM supporters not to invest in IPOs tied to SpaceX, OpenAI, or Anthropic. The author argues that frontier labs drive up demand and prices for GPUs, RAM, SSDs, HDDs, and NAS hardware, making local inference harder. The post also questions AI company valuations, but its claims are mostly opinion and speculation without cited evidence.
WWDC 2026: Everything announced on Siri, iOS 27, Apple Intelligence and more
TechCrunch AI6 days agoRelease
Apple’s WWDC 2026 kicked off at Apple Park with expected announcements around Siri, iOS 27, Apple Intelligence, and developer demos. The event is notable as Tim Cook’s last WWDC as CEO before John Ternus takes over on September 1. Early updates include Liquid Glass opt-in adjustments, iOS 27 support back to iPhone 11, and claimed speed gains for Photos, AirDrop, and multitasking.
Show HN: Gitdot – a better GitHub, open-source, anti-AI, written in Rust
Hacker News (AI keywords)6 days agoNew Tool
Gitdot appeared on Hacker News as a Show HN project claiming to be “a better GitHub.” The title says it is open-source, written in Rust, and explicitly anti-AI. No article body was provided, so details about features, licensing, deployment, maturity, and how it differs from GitHub cannot be confirmed from the source.
An Implementation of NanoQuant: A Flexible Binary Quantization Method
r/LocalLLaMA top day6 days agoNew Tool
A r/LocalLLaMA post presents an unofficial PyTorch implementation of NanoQuant, a 2026 post-training quantization method for dense transformers. The method factorizes weights into scaling vectors and binary matrices, then quantizes and fine-tunes blocks sequentially to reduce hardware requirements. Early Qwen3-0.6B and Qwen3-4B experiments are promising for base models, but instruct quality remains weak and highly dependent on calibration data.
I bundled a fully local LLM inside my Unity game
r/LocalLLaMA top day6 days agoRelease
A developer shared a Unity game, Simulation Simulator, that bundles a local LLM with no internet, cloud service, or API key required. The game is a campfire chat simulator about DMT, simulation theory, and a monitor-headed friend, with five endings driven by natural AI interaction. The author sees this as a path toward richer NPCs, while noting local TTS and translation are still too slow for smooth gameplay.
GitHub for Beginners: Answers to some common questions
GitHub Blog6 days agoTutorial
GitHub Blog published a beginner-focused article that points readers to answers for common GitHub-related questions. The provided excerpt is brief and does not specify which questions or workflows are covered. Its value is mainly educational, serving as an entry point for people who are new to GitHub rather than as a product launch or technical deep dive.
Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72
r/LocalLLaMA top day6 days agoBenchmark
Xiaomi announced MiMo-V2.5-Pro-UltraSpeed with TileRT, claiming over 1,000 tokens/s decode speed on a 1-trillion-parameter MoE model. The company says it runs on a single standard 8-GPU commodity node, not wafer-scale or SRAM-heavy specialized hardware. The claimed stack combines FP4 MoE expert quantization, DFlash speculative decoding, and TileRT low-latency inference kernels, but independent validation is still needed.
AI Is Slowing Down
Hacker News (AI keywords)6 days agoCommentary
The article argues generative AI must keep accelerating to justify massive data center, cloud, and GPU commitments. Zitron says OpenAI, Anthropic, hyperscalers, and NVIDIA depend on AI services reaching extraordinary revenue levels by 2029-2030. He points to token-based billing, weak ROI visibility, enterprise spending caps, and customer pushback as signs that demand may be cooling before the infrastructure bet can pay off.
Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72
r/LocalLLaMA top day6 days agoNew Tool
Luce Spark is an open-source MoE offload system for running 33B-35B A3B models on 16GB-class GPUs. It keeps frequently routed experts on GPU, stores the long tail in system RAM, and swaps cold experts through a bounded async cache. The author reports 13.3 GiB for Qwen3.6 35B-A3B and about 100 tok/s with Spark optimizations, but notes real 16GB GPU testing is still missing.
OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more
r/LocalLLaMA top day6 days agoNew Tool
OpenEnv is a tool for creating agentic execution environments such as terminals, browsers, or other systems an agent can interact with. The project will now be coordinated by a committee including Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face. The post also lists many AI organizations supporting or adopting OpenEnv, positioning it as infrastructure for open-source agent training.
Lies we tell ourselves about email addresses
Hacker News (AI keywords)6 days agoOpinion
Developers routinely underestimate the complexity of email address validation, relying on oversimplified regex rules that fail on edge cases allowed by RFC standards. Common myths include case-insensitivity, deliverability implied by syntax validity, and uniqueness assumptions that break with subaddressing or dot-normalization. The article serves as a reminder that email, despite feeling familiar, remains a surprisingly deep protocol with many traps for the unwary.
[3090] Gemma4 QAT + MTP quick TPS numbers
r/LocalLLaMA top day6 days agoBenchmark
A r/LocalLLaMA user shared quick throughput numbers for Gemma4 QAT with MTP speculative decoding on an RTX 3090 24GB setup. They report roughly 1.2-1.8x TPS improvement, with Gemma 4 31B moving from about 40 tok/s to 70-80 tok/s. The author frames this as a rough benchmark, using 11 task categories and noting stochastic variation from temp 1.0.
Show HN: Performative-UI – a React component library of design tropes
Hacker News (AI keywords)6 days agoNew Tool
Performative-UI appeared on Hacker News as a Show HN project. Based on the title, it presents itself as a React component library for design tropes rather than a conventional neutral UI kit. Without the original article text, details such as component coverage, licensing, accessibility, installation, TypeScript support, and production readiness cannot be verified.
Microsoft’s AI chief says superintelligence is near, but won’t take your job
The Verge AI6 days agoBusiness
The Verge interviews Microsoft AI CEO Mustafa Suleyman about the company’s approach to advanced AI, superintelligence, AGI, OpenAI, and automation. His message is that more powerful AI systems are arriving soon, but Microsoft wants them to remain human-controlled and human-serving. The piece is less a product announcement than a window into Microsoft’s strategic framing of AI progress and job disruption.
mtmd adds video input support in llama.cpp★ 72
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24269, adding video input support to mtmd through mtmd-cli and /chat/completions, which also enables the web UI path. The implementation invokes a locally installed ffmpeg subprocess instead of bundling codec support, and currently extracts visual frames only, with no audio support yet. It was tested with Qwen3-VL-2B in CLI and Gemma 4 E4B in web UI, making local multimodal video experiments more accessible.
"Chat is dead": OpenAI preps overhaul of ChatGPT★ 76
Ars Technica AI6 days agoBusiness
OpenAI is reportedly preparing the biggest ChatGPT overhaul since launch, shifting it beyond a chat interface toward a “super app” built around agents, coding tools, and third-party services. The move is tied to higher-margin revenue, enterprise customers, and a potential IPO. ChatGPT may become a gateway that steers its massive user base toward products like Codex, image generation, and partner apps.
Gemma 4 Chat Template now has preserve thinking
r/LocalLLaMA top day6 days agoRelease
A r/LocalLLaMA post notes that Gemma 4’s chat template now has “preserve thinking.” The linked discussion points to google/gemma-4-31B-it on Hugging Face, suggesting a template-level change rather than a new model release or benchmark. The original post does not provide detailed usage notes, defaults, compatibility information, or measured effects.
Launch HN: Intuned (YC S22) - Reliable browser automations as code
Hacker News (AI keywords)6 days agoNew Tool
Intuned, a YC S22 company, appeared on Hacker News as a Launch HN post. Based on the title, it focuses on helping teams build and run reliable browser automations as code. The original article content was not provided, so details such as architecture, AI features, supported languages, pricing, and concrete use cases cannot be verified.
The crash that vanished: control and emergence in a five-model economy
Hugging Face Blog6 days agoCommentary
With no source text provided, this can only be inferred from the title. The post appears to examine a five-model economy where a potential crash disappears under some form of control or changed system dynamics. Its likely relevance is in multi-agent or multi-model systems, where collective behavior can diverge from individual model behavior.
Zig by Example
Hacker News (AI keywords)6 days agoTutorial
Zig by Example is a GitHub tutorial project inspired by Go by Example. It introduces Zig through annotated examples covering syntax, types, control flow, errors, pointers, comptime, generics, allocation, testing, file I/O, JSON, the build system, and C interop. This is not an AI model or product launch, but it is useful learning material for developers exploring Zig 0.14.
What was your local daily driver for coding last week?
r/LocalLLaMA top day6 days agoCommentary
This r/LocalLLaMA post is a brief community poll asking users what their local coding daily driver was last week. The post asks commenters to share their favorite model and quant, but the provided text does not include poll options, results, or specific model names. Its value is mainly as a community signal for tracking local LLM coding preferences.
Turning Cloudflare’s threat indicators into real-time WAF rules
Cloudflare Blog6 days agoRelease
Cloudflare customers can now apply Cloudforce One threat intelligence inside the WAF to block high-risk traffic. New cf.intel fields let security teams automate protections based on specific threat actors and targeted industries. The update turns threat indicators into real-time enforcement signals, reducing the gap between intelligence and active blocking.
llama.cpp PR #24277 avoids KV cell copies in kv-cache
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24277 by ggerganov, titled “kv-cache: avoid kv cells copies.” The Reddit post says the change improves MTP performance for Gemma-4 and was merged the previous day. It is available starting with the b9551 release, making it relevant for local inference users tracking llama.cpp performance updates.

← PreviousPage 11Next →

Latest in AI

Apple is fixing bill splitting with its new Siri in Camera feature

Ask HN: What are tools you have made for yourself since the advent of AI?

Siri AI

When every other post is an AI benchmark, best-model question, or slop app

Apple’s long-awaited AI Siri overhaul is finally here★ 72

Full Reverse Engineering of the TI-84 Plus Operating System

Apple announces Siri AI and its next generation of Apple Intelligence★ 72

LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs

WWDC 2026: Everything announced on Siri, iOS 27, Apple Intelligence and more

Show HN: Gitdot – a better GitHub, open-source, anti-AI, written in Rust

An Implementation of NanoQuant: A Flexible Binary Quantization Method

I bundled a fully local LLM inside my Unity game

GitHub for Beginners: Answers to some common questions

Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72

AI Is Slowing Down

Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72

OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more

Lies we tell ourselves about email addresses

[3090] Gemma4 QAT + MTP quick TPS numbers

Show HN: Performative-UI – a React component library of design tropes

Microsoft’s AI chief says superintelligence is near, but won’t take your job

mtmd adds video input support in llama.cpp★ 72

"Chat is dead": OpenAI preps overhaul of ChatGPT★ 76

Gemma 4 Chat Template now has preserve thinking

Launch HN: Intuned (YC S22) - Reliable browser automations as code

The crash that vanished: control and emergence in a five-model economy

Zig by Example

What was your local daily driver for coding last week?

Turning Cloudflare’s threat indicators into real-time WAF rules

llama.cpp PR #24277 avoids KV cell copies in kv-cache