GMKtec has announced its EVO-X3 mini PC with upgraded I/O, including OCuLink and Wi-Fi 7. More importantly for local AI enthusiasts, the company teased a future model powered by AMD's flagship "Strix Halo" Ryzen AI MAX+ 495 APU. This upcoming monster will support up to 192GB of LPDDR5X memory, offering a highly anticipated, cost-effective alternative to Apple Silicon for running large local LLMs.
A popular Reddit post on r/LocalLLaMA highlights a user's X99 motherboard finally dying. The Intel X99 platform, paired with cheap recycled Xeon CPUs, has long been a legendary budget choice for running local LLMs with multiple GPUs. The post triggered a wave of nostalgic "F" comments, marking the gradual end of these classic DIY budget rigs.
A developer has shared a practical guide on clustering three NVIDIA Jetson Nano Orin Super boards, leveraging their Ampere CUDA cores and unified memory. This project is part of 'smolcluster,' an initiative to make distributed AI training and inference accessible using everyday hardware like Macs, Raspberry Pis, and Jetsons. The series aims to explore whether heterogeneous clusters (mixing different hardware architectures) can effectively run local LLMs.
Based only on the title, Nvidia appears to be proposing a high-end CPU system for Windows PCs. That could signal deeper ambitions beyond GPUs and AI accelerators into the core PC platform. However, no article text is available, so the architecture, specs, partners, timing, and product positioning remain unconfirmed.
At Computex, Marvell argued that connectivity is becoming a key bottleneck for AI infrastructure as systems scale. NVIDIA CEO Jensen Huang appeared at the event and described Marvell as the next trillion-dollar company. The presentation highlighted Marvell's AI connectivity stack, reflecting growing industry attention on the links supporting large-scale AI systems.
Well-known tech blogger Simon Willison recently shared and recommended an article by David Oks that provides an in-depth analysis of how the AI boom is…
According to the latest reports from foreign media, South Korean tech giant Samsung Electronics has reached a preliminary settlement agreement with employees…
The U.S. government recently announced that it will allocate $2 billion under the CHIPS and Science Act to directly invest in and fund nine domestic quantum…
AMD CEO Lisa Su recently shared her latest views on the AI hardware market, pointing out that the AI industry is approaching a critical inflection point…
Firefox is promoting a browser-based workflow for Adafruit users through Web Serial. The page says users can connect, code, and control compatible hardware devices from supported web tools without a separate desktop app. It points toward CircuitPython workflows and open web tooling, but it does not describe any AI model, benchmark, or generative AI feature.
The mysterious AI startup Hark has announced the successful completion of a Series A funding round totaling $700 million (approximately NT$22 billion), capital…
Nvidia CEO Jensen Huang has recently put forward a major market prediction, stating that Nvidia has its sights set on a brand-new market worth as much as $200…
AI chip design unicorn Cerebras Systems officially entered its long-awaited initial public offering (IPO) at a valuation of $60 billion. This company…
This edition of Import AI (Issue 444), written by Jack Clark, delves into the latest breakthroughs in artificial intelligence across three domains: social…
Against the backdrop of explosive global growth in artificial intelligence, compute has become the core resource that determines technological competitiveness…
Hugging Face's official blog has announced that its widely adopted open-source large model inference framework, Text Generation Inference (TGI), now officially…
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
With the explosive growth of generative AI, demand for high-performance GPUs has reached an unprecedented level. To break hardware monopolies and reduce AI…
As enterprise demand for Retrieval-Augmented Generation (RAG) technology surges, how to maintain high performance while controlling hardware costs has become…
AMD and Hugging Face have jointly announced the "AMD Pervasive AI Developer Contest," a global competition designed to inspire developers worldwide to build…
This article presents the results of a collaboration between Hugging Face and the Intel Habana team, focusing on how to leverage Intel's Habana Gaudi2 deep…
As large language models (LLMs) and generative AI exploded in popularity, demand for computing power surged dramatically, leaving Nvidia GPUs (such as the…