Google DeepMind has officially announced the launch of Gemini Robotics 1.5, marking the formal entry of AI Agent technology into the physical world and…
Google DeepMind has unveiled a new AI Agent called "CodeMender," designed to leverage advanced artificial intelligence to automatically remediate critical…
Google DeepMind has officially launched the new dedicated "Gemini 2.5 Computer Use" model, which is now available in preview via API. This model is built on…
Hugging Face has announced the launch of a new open-source project called "OpenEnv," aimed at collaboratively building an open and standardized execution…
University of Pennsylvania Wharton School professor Ethan Mollick, in his latest article, explores the far-reaching changes brought about by AI Agents…
As AI Agent applications become increasingly widespread, running large language models (LLMs) efficiently on personal computers (such as AI PCs powered by…
AI agents are currently the hottest research direction in the AI field, but how to objectively, safely, and reproducibly evaluate agent capabilities has long…
Vercel officially announced the launch of "402-mcp" in its Changelog — an important update that injects commercial capabilities into the Model Context Protocol…
### Background and Core Concepts Traditional large language models (LLMs), when faced with complex mathematics, data analysis, or programming tasks, can…
With Anthropic's introduction of the Model Context Protocol (MCP), the development paradigm for AI applications is undergoing a quiet revolution. In a recent…
As the use of AI in academic research becomes increasingly widespread, enabling large language models (LLMs) to access the latest scientific literature in real…
Hugging Face has recently introduced a new benchmark called "TextQuests," designed to evaluate the performance of large language models (LLMs) in text-based…
Renowned AI scholar and Wharton School professor Ethan Mollick published a forward-looking observation about GPT-5 on his blog "One Useful Thing," titled…
This article provides a detailed look at how NVIDIA is using its open-source Llama Nemotron series of models to evaluate and build top-performing, portable…
As AI applications become more widespread, how to allow large language models (LLMs) to securely and efficiently access enterprise internal data or external…
Vercel has announced a major update to its AI development tooling, launching a new service based on the Model Context Protocol (MCP) that allows developers to…
The Model Context Protocol (MCP) is an open standard introduced by Anthropic, designed to allow AI assistants (such as Claude) to interact securely and…
### What is FutureBench? As large language models (LLMs) and AI agents have rapidly advanced, traditional static benchmarks (such as MMLU and GSM8K) face a…
With the rise of Anthropic's Claude 3.5 Sonnet "Computer Use" and various GUI-oriented multimodal models, "desktop agents" have become one of the hottest areas…
Hugging Face has officially announced the launch of its dedicated MCP (Model Context Protocol) server — a major step in ecosystem integration. The Model…
With Anthropic's introduction of the Model Context Protocol (MCP) open standard, the way AI agents connect to external tools and data sources has become…
As large language models (LLMs) have evolved, AI applications have moved beyond simple "question-and-answer conversations" toward "AI Agents" capable of…
As artificial intelligence moves beyond simple "text-based conversation" into the era of Agents (intelligent agents) that actively execute tasks, enabling AI…
H (formerly Holistic AI), a highly regarded French AI startup, recently officially released a new family of vision-language models (VLMs) on the Hugging Face…
In this Hugging Face blog post, the team takes a deep dive into the evolution of AI agent architectures — specifically how to combine "structured constraints"…
Hugging Face recently published a highly practical technical tutorial demonstrating how to build a fully functional miniature AI agent in just around 70 lines…
With the explosion of multimodal technology, Vision Language Models (VLMs) have evolved from laboratory research prototypes into core tools for enterprises and…
Since Anthropic introduced the Model Context Protocol (MCP) open standard, connecting large language models (LLMs) to external tools has never been easier. The…
ServiceNow recently published a new open-source project called PipelineRL on the Hugging Face platform. As large language model (LLM) and AI agent systems move…
In this Hugging Face blog post, the team demonstrates how to implement a fully functional, lightweight AI agent (referred to as a "Tiny Agent") that supports…