Zhipu AI has released GLM 5.2, a point update to its flagship General Language Model series. GLM models are widely used for multilingual tasks, particularly in Chinese-language applications, and are available both as a commercial API and as open-weight downloads. The release was noted on Hacker News, though specific feature changes, benchmark results, and technical details for version 5.2 were not available from the source.
This is Issue #21 of the "Open Artifacts" column by well-known AI commentator Nathan Lambert, exploring the explosive growth in the open-weights and…
Hugging Face's official blog has announced that DeepInfra — a well-known high-performance, low-cost serverless inference platform — has officially joined…
Google DeepMind has officially released Gemini 3.1 Flash-Lite, the fastest and most cost-efficient version within its latest generation Gemini 3 series of…
Mixture of Experts (MoE) has become the mainstream architecture for current large language models (LLMs). This article takes an in-depth look at how MoE…
Hugging Face's official blog has announced exciting news for the open-source AI community: Hugging Face has formed a deep partnership with Unsloth — the…
In today's era of rapid AI advancement, major model vendors and research institutions are releasing all manner of "leaderboards" to claim their models surpass…
The Technology Innovation Institute (TII) of the United Arab Emirates has officially released the new "Falcon-H1-Arabic" model on the Hugging Face platform…
Google DeepMind has today officially introduced its latest generation AI model — Gemini 3 Flash. The model's core positioning is "built for speed," designed to…
With the successive emergence of models with powerful "reasoning" capabilities — such as OpenAI o1, o3, and DeepSeek-R1 — the challenge of reducing the…
Google DeepMind today officially unveiled its latest generation AI model family — Gemini 3 — and extended an invitation to developers worldwide, formally…
Google DeepMind today announced that Gemini 2.5 Flash-Lite — its lightweight AI model that had previously been in preview — has officially transitioned to a…
Wharton School professor Ethan Mollick has put together a highly personal and practical operating guide for the AI landscape of late 2025. He emphasizes that…
Hugging Face continues to expand its "Inference Providers" program, aimed at enabling developers to run open-source models from Hugging Face Hub in the…
Hugging Face and Together AI have announced a deep partnership, launching a new integration designed to streamline the fine-tuning workflow for open-source…
As generative AI advances rapidly, deploying massive models to resource-constrained edge devices — such as smartphones, smart hardware, and AI PCs — has become…
Vercel announced in its official Changelog that OpenAI's latest generation flagship model GPT-5, along with its lightweight version GPT-5-mini and…
The Hugging Face official blog has announced exciting news, formally welcoming OpenAI's newly launched open-source model family — "GPT OSS." This is undeniably…
Hugging Face has announced the release of a brand-new generation of lightweight open-source models — SmolLM3. As the latest member of the SmolLM family…
Hugging Face and the UAE's Technology Innovation Institute (TII, the organization behind the well-known open-source model Falcon) have jointly announced a new…
Google DeepMind today announced a major update to the Gemini 2.5 thinking models family, aimed at improving overall performance and accuracy while providing…
Google DeepMind today announced a major advancement for the Gemini 2.5 model family. First, the previously preview-stage Gemini 2.5 Flash and Gemini 2.5 Pro…
Replicate, the well-known AI model hosting and deployment platform, has announced a major update: it now officially supports OpenAI's latest-generation models…
Google DeepMind today announced important updates to its flagship model series, Gemini 2.5. The most noteworthy highlight of this update is a brand-new…
As large language models (LLMs) and vision language models (VLMs) continue to scale up, running these models on limited hardware resources — such as…
### Background and Pain Points: Moving Beyond the Overly Simple "Needle in a Haystack" Test In recent years, the context window length supported by large…
Although AINews characterized these two days as "a calm day," in reality, tech giants and the open-source community remained full of undercurrents. First, on…
On February 14, 2025, Hugging Face — the leading open-source AI community — officially announced the integration of high-performance AI inference platform…
Hugging Face has officially launched the "Inference Providers" feature on the Hugging Face Hub — a major update designed to address the pain points developers…
The Technology Innovation Institute (TII) of Abu Dhabi has officially launched the new Falcon 3 open-source model family on Hugging Face. This marks a major…