Latest in AI

Showing:on-device-aiClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

iOS 27, Apple Intelligence, and Siri AI: Supported Device Notes
INSIDE 硬塞 AI2 days agoHardware
INSIDE’s brief compatibility note says Apple Intelligence support is almost equivalent to Siri AI support. However, it highlights an exception: some features need a more advanced on-device model. Those higher-end Siri AI capabilities currently support only iPhone 17 Pro, iPhone 17 Pro Max, and iPhone Air.
Benchmarking Google Eloquent Exposes Major On-Device Dictation Reliability Issues
r/LocalLLaMA top day3 days agoBenchmark
A LocalLLaMA user tried to benchmark Google’s new fully local dictation app, Eloquent, against open ASR models such as Qwen3-ASR and NVIDIA Parakeet V3. The tester reported that roughly half of dictations returned only fragments, even during manual use. When Eloquent produced complete transcripts, its word error rate was competitive, but the missing-output behavior made the app unreliable for evaluation and practical use.
Reddit Debate: Apple and Microsoft Push Local-First AI
r/LocalLLaMA top day4 days agoOpinion
A Reddit user claims Apple and Microsoft have both made strong moves toward local-first AI, pointing to Apple Core AI materials and Microsoft Surface Laptop Ultra announcements. The post argues that Apple’s emphasis on local, private, no-cost AI and Microsoft’s Surface/Nvidia direction could reshape expectations for consumer hardware. However, it is an opinion-driven market prediction, not a confirmed financial or technical analysis.
Apple Announced a New On-Device Inference Engine for Apple Silicon
r/LocalLLaMA top day5 days agoRelease
Apple announced CoreAI at WWDC, which the post frames as a possible future replacement for CoreML and an alternative to MLX, llama.cpp, and torch for optimized on-device inference. Models still need conversion through Python scripts, and current supported models appear mostly from mid-2025. No performance data is available yet; the author expects it may trail MLX on GPU, but Apple’s 20B on-device foundation model claim suggests larger app-bundled models could become possible.
Siri AI at WWDC 2026★ 72
Simon Willison's Weblog5 days agoCommentary
Simon Willison says Apple’s 2024 Apple Intelligence rollout made him cautious, so he will believe the WWDC 2026 Siri AI claims only after seeing results. He notes the new features look more feasible, especially with a custom Gemini-derived model running on Private Cloud Compute. He also highlights vision LLM screen understanding and the new Core AI library for running PyTorch-derived models on Apple hardware.
Apple Core AI Framework★ 76
Hacker News (AI keywords)5 days agoRelease
Apple’s Core AI framework is positioned as a developer stack for deploying AI models directly inside apps on Apple silicon. The documentation describes Swift APIs, `.aimodel` assets, model specialization, caching, Xcode profiling, and debugging tools. It appears aimed at developers building low-latency, privacy-conscious on-device inference workflows, though the documentation is marked as preliminary beta information.
Launch HN: General Instinct (YC P26) - Frontier models on edge devices
Hacker News (AI keywords)9 days agoNew Tool
General Instinct is a YC P26 company introduced through a Launch HN post. Its headline positioning is bringing frontier models to edge devices, suggesting local or embedded AI deployment rather than purely cloud-based inference. Since no article body is available, details such as supported models, hardware, benchmarks, pricing, and developer tooling cannot be verified from the provided source.
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72
Hacker News (AI keywords)9 days agoRelease
Google released new Gemma 4 checkpoints optimized with Quantization-Aware Training to preserve quality after compression. The release includes Q4_0 checkpoints and a mobile-focused quantization format that can reduce Gemma 4 E2B memory use to about 1GB, or below 1GB for a text-only configuration. The models are available through Hugging Face and supported across llama.cpp, Ollama, LM Studio, LiteRT-LM, Transformers.js, SGLang, vLLM, MLX, and Unsloth.
Apple working to cram massive Gemini model into iPhone to power new Siri
Ars Technica AI17 days agoBusiness
Ars Technica reports that Apple is working to compress Google’s massive Gemini model so it can run on iPhone and power a new Siri experience. The short summary emphasizes a key constraint: even with on-device ambitions, a cloud component is probably inevitable. Details remain limited, so the report is best read as a signal about Apple’s AI direction rather than a confirmed product launch.
Hugging Face 推出 swift-huggingface：專為 Swift 開發者打造的完整 Hugging Face 用戶端 SDK★ 75
Hugging Face Blog191 days agoRelease
Hugging Face has officially released `swift-huggingface`, a complete Swift client SDK designed specifically for the Apple ecosystem (including iOS, macOS…
Google DeepMind 發表 Nano Banana Pro：聚焦裝置端與邊緣運算的全新輕量化模型（推測）
Google DeepMind Blog206 days agoRelease
Google DeepMind published a blog post on November 20, 2025 titled "Introducing Nano Banana Pro." As the full content of the original article is not publicly…
介紹 AnyLanguageModel：適用於 Apple 平台的本地與遠端 LLM 統一 API★ 75
Hugging Face Blog206 days agoNew Tool
Hugging Face has officially launched a new open-source Swift framework called "AnyLanguageModel," designed to address the pain points faced by developers on…
Granite 4.0 Nano：探索端側 AI 的極限，模型究竟能縮到多小？★ 75
Hugging Face Blog229 days agoRelease
This article, jointly published by IBM and Hugging Face, delves into the technical details and application scenarios of the brand-new ultra-lightweight model…
Swift Transformers 邁向 1.0 版本：開啟 Apple 平台本地端 AI 的新未來★ 75
Hugging Face Blog261 days agoRelease
Hugging Face has announced that `swift-transformers`, its open-source library designed specifically for the Apple ecosystem, has officially reached the stable…
Arm 與 ExecuTorch 0.7 聯手：將生成式 AI 推向大眾市場★ 80
Hugging Face Blog305 days agoRelease
As generative AI advances rapidly, deploying massive models to resource-constrained edge devices — such as smartphones, smart hardware, and AI PCs — has become…
Hugging Face 發表 SmolLM3：輕量、多語言、長上下文的端側推理模型★ 80
Hugging Face Blog341 days agoRelease
Hugging Face has announced the release of a brand-new generation of lightweight open-source models — SmolLM3. As the latest member of the SmolLM family…
Falcon-Edge：TII 推出一系列強大、通用且可微調的 1.58-bit 邊緣端語言模型★ 82
Hugging Face Blog395 days agoRelease
The Technology Innovation Institute (TII) of the United Arab Emirates has officially released the "Falcon-Edge" model series on Hugging Face. This is a family…
Hugging Face 推出 SmolVLM：輕量且強大的開源視覺語言模型，可在本機高效運行★ 80
Hugging Face Blog565 days agoRelease
Hugging Face has officially launched a lightweight vision language model (VLM) called **SmolVLM**, designed to bring powerful multimodal understanding…
在 Apple Silicon 上使用 Core ML 執行 Stable Diffusion★ 75
Hugging Face Blog1,291 days agoRelease
In late 2022, Apple and Hugging Face jointly announced that Stable Diffusion had officially gained support for Apple Silicon's Core ML framework. This update…

Latest in AI

iOS 27, Apple Intelligence, and Siri AI: Supported Device Notes

Benchmarking Google Eloquent Exposes Major On-Device Dictation Reliability Issues

Reddit Debate: Apple and Microsoft Push Local-First AI

Apple Announced a New On-Device Inference Engine for Apple Silicon

Siri AI at WWDC 2026★ 72

Apple Core AI Framework★ 76

Launch HN: General Instinct (YC P26) - Frontier models on edge devices

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72

Apple working to cram massive Gemini model into iPhone to power new Siri

Hugging Face 推出 swift-huggingface：專為 Swift 開發者打造的完整 Hugging Face 用戶端 SDK★ 75

Google DeepMind 發表 Nano Banana Pro：聚焦裝置端與邊緣運算的全新輕量化模型（推測）

介紹 AnyLanguageModel：適用於 Apple 平台的本地與遠端 LLM 統一 API★ 75

Granite 4.0 Nano：探索端側 AI 的極限，模型究竟能縮到多小？★ 75

Swift Transformers 邁向 1.0 版本：開啟 Apple 平台本地端 AI 的新未來★ 75

Arm 與 ExecuTorch 0.7 聯手：將生成式 AI 推向大眾市場★ 80

Hugging Face 發表 SmolLM3：輕量、多語言、長上下文的端側推理模型★ 80

Falcon-Edge：TII 推出一系列強大、通用且可微調的 1.58-bit 邊緣端語言模型★ 82

Hugging Face 推出 SmolVLM：輕量且強大的開源視覺語言模型，可在本機高效運行★ 80

在 Apple Silicon 上使用 Core ML 執行 Stable Diffusion★ 75