Wharton School professor Ethan Mollick has put together a highly personal and practical operating guide for the AI landscape of late 2025. He emphasizes that…
At a time when AI-assisted development is rapidly evolving, Vercel has published a deep technical breakdown exploring how to natively integrate its React…
University of Pennsylvania Wharton School professor Ethan Mollick, in his latest article, compares the experience of collaborating with generative AI (such as…
### Background and the LLM Inference Bottleneck When running large language models (LLMs), autoregressive generation is inherently "memory-bandwidth-bound"…
In this article exploring "Mass Intelligence," University of Pennsylvania Wharton School professor Ethan Mollick reveals an imminent future: high-level…
Hugging Face has recently introduced a new benchmark called "TextQuests," designed to evaluate the performance of large language models (LLMs) in text-based…
Renowned AI scholar and Wharton School professor Ethan Mollick published a forward-looking observation about GPT-5 on his blog "One Useful Thing," titled…
Vercel announced in its official Changelog that OpenAI's latest generation flagship model GPT-5, along with its lightweight version GPT-5-mini and…
The Hugging Face official blog has announced exciting news, formally welcoming OpenAI's newly launched open-source model family — "GPT OSS." This is undeniably…
As large multimodal models (LMMs) have achieved breakthroughs in image and short-video understanding, the industry has gradually shifted its attention to the…
### What is FutureBench? As large language models (LLMs) and AI agents have rapidly advanced, traditional static benchmarks (such as MMLU and GSM8K) face a…
With the rise of Anthropic's Claude 3.5 Sonnet "Computer Use" and various GUI-oriented multimodal models, "desktop agents" have become one of the hottest areas…
University of Pennsylvania Wharton School professor Ethan Mollick recently published an extremely practical AI quick guide, "Using AI Right Now: A Quick…
Replicate, the well-known AI model hosting and deployment platform, has announced a major update: it now officially supports OpenAI's latest-generation models…
Wharton School professor Ethan Mollick, in his latest article "Personality and Persuasion," delves into AI's persuasive power and the psychological mechanisms…
OpenAI recently held a live stream and published a blog post to officially announce the new reasoning model o3 and the lightweight reasoning model o4-mini…
### Background and Pain Points: Moving Beyond the Overly Simple "Needle in a Haystack" Test In recent years, the context window length supported by large…
OpenAI has officially released its new flagship model GPT 4.1, positioned as the next-generation "workhorse" designed to give developers and enterprises the…
Although AINews characterized these two days as "a calm day," in reality, tech giants and the open-source community remained full of undercurrents. First, on…
As large language model (LLM) technology has evolved, AI has transformed from a simple question-answering assistant into an "AI agent" capable of proactively…
On January 24, 2025, Hugging Face announced that smolagents — its open-source library designed for building lightweight, high-performance AI agents — now…
Hugging Face officially launched a lightweight AI agent development framework called `smolagents` at the end of 2024. The core philosophy of this tool is "Code…
### Background and Challenges: The Difficulty of Evaluating Non-English LLMs In the current landscape of large language model (LLM) development, evaluating…
This article from the Hugging Face blog introduces "The First Multilingual LLM Debate Competition." As large language models (LLMs) have rapidly advanced…
As large language models (LLMs) have rapidly advanced, traditional static benchmarks (such as MMLU) have increasingly faced saturation and gaming problems. As…
This case study provides a detailed account of how non-profit organization Digital Green, with support from Hugging Face's Expert Support team, optimized its…
As generative AI applications become more widespread, one of the biggest challenges developers face is the "non-deterministic" output of large language models…
### Background and Challenges Document Visual Question Answering (DocVQA) is an important application of multimodal AI, requiring models to simultaneously…
The Hugging Face team published a blog post announcing that their Code Agent, developed using the `transformers` library, achieved a breakthrough score on the…
As large language models (LLMs) have made tremendous strides in code generation, the long-standing industry gold standard — the HumanEval benchmark — has…