Latest in AI

Showing:edge-aiFoundersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-regulation2 government-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Bonsai LM 1-bit and 1.58-bit Benchmarks on Jetson Orin Nano Super
r/LocalLLaMA top day4 days agoBenchmark
A LocalLLaMA post benchmarks five Bonsai LM models, from 1.7B to about 8B parameters, on a $250 Jetson Orin Nano Super 8GB using llama.cpp CUDA. The tests compare 7W, 15W, 25W, and MAXN modes across latency, throughput, energy per token, and thermals. The main takeaway is that 25W is usually the best efficiency/performance point for models up to 4B, while Bonsai-8B may favor 15W for lower power.
From Desk-Side to Data Center: Leadtek Showcases On-Prem Agentic AI Computing Strategy at COMPUTEX 2026
INSIDE 硬塞 AI4 days agoHardware
The article says enterprise AI adoption is entering a new phase as security concerns, cloud latency, and model changes push compute needs on premises. At COMPUTEX 2026, Leadtek presented an AI compute spectrum from factory edge environments to data centers. The focus is helping companies keep tighter control over agentic AI secrets and inference responsiveness.
Introducing Mistral 3★ 84
Mistral AI News6 days agoRelease
Mistral AI introduced Mistral 3, a new open model family under Apache 2.0. It includes Mistral Large 3, a 675B-parameter sparse MoE with 41B active parameters, plus Ministral 3 models at 3B, 8B, and 14B. The release targets frontier open-weight use, multimodal and multilingual workflows, enterprise customization, and efficient local or edge deployments.
Introducing Mistral 3★ 78
Mistral AI News6 days agoRelease
Mistral AI introduced Mistral 3, a new open model family including Mistral Large 3 and Ministral 3 models at 3B, 8B, and 14B sizes. Large 3 is a 675B-parameter sparse MoE model with 41B active parameters, while Ministral 3 targets local and edge use cases. The models are released under Apache 2.0 and are available through Mistral AI Studio, Hugging Face, Amazon Bedrock, and other platforms.
Launch HN: General Instinct (YC P26) - Frontier models on edge devices
Hacker News (AI keywords)9 days agoNew Tool
General Instinct is a YC P26 company introduced through a Launch HN post. Its headline positioning is bringing frontier models to edge devices, suggesting local or embedded AI deployment rather than purely cloud-based inference. Since no article body is available, details such as supported models, hardware, benchmarks, pricing, and developer tooling cannot be verified from the provided source.
QNAP Showcases Ready & Recovery and Edge AI Enterprise IT Architecture at COMPUTEX 2026
INSIDE 硬塞 AI11 days agoHardware
QNAP appeared at COMPUTEX 2026 with “Ready & Recovery” and “Edge AI” as its two main themes. The showcase covered backup and recovery, anti-ransomware protection, high availability, on-prem generative AI, 100G networking, smart surveillance, and media workflows. The company also revealed multiple AI NAS products and enterprise switches, positioning its portfolio around data resilience, AI computing, and security.
Z-COM to Officially Launch NEW Platform at Computex 2026
INSIDE 硬塞 AI11 days agoHardware
Z-COM will officially introduce NEW Platform at Computex 2026. The edge-native infrastructure combines network control, AI operations, and energy management in a single architecture. Its stated goal is to support local AI computing and help enterprises reduce dependence on cloud providers and avoid cloud lock-in.
Microsoft Build 2026 unveils MAI-Thinking-1, Scout, and Project Solara★ 76
INSIDE 硬塞 AI11 days agoRelease
At Build 2026, Microsoft introduced an agent-first architecture that combines software and hardware into a broader AI platform. The announcement includes a unified Copilot app, self-developed MAI models, the persistent Scout agent, and the Project Solara device platform. The move frames AI agents as an end-to-end execution layer running from cloud services to user devices.
Nvidia chases $200B CPU market with AI agent PCs from Microsoft, Dell, and HP
TechCrunch AI12 days agoHardware
Nvidia is pursuing the $200 billion CPU market through AI agent PCs associated with Microsoft, Dell, and HP. The potential impact depends on whether AI agents can reach mainstream users in a simple, safe, and useful way. The provided excerpt does not specify hardware models, pricing, release dates, or performance details.
Qualcomm Unveils Dragonfly Data Center Brand for the Agentic AI Era
INSIDE 硬塞 AI13 days agoHardware
At Computex 2026, Qualcomm described AI agents as a major driver of cross-device hardware upgrades. The company unveiled Dragonfly, a new data center brand focused on inference computing. The announcement outlines a broader strategy spanning endpoint devices and cloud infrastructure, although the source does not provide specifications, performance figures, or deployment timelines.
Business Owners, Do You Really Know How to Use AI?
INSIDE 硬塞 AI18 days agoOpinion
The article argues that many companies use AI mainly to improve efficiency, without creating meaningful revenue or strategic advantage. It proposes distributed AI, placing intelligence closer to where data is generated to reduce latency and support faster decisions. The key message is that firms should balance centralized and distributed architectures to strengthen competitiveness while preserving greater control over data and digital sovereignty.