In an era of exploding AI applications, the competition and evolution of underlying AI infrastructure (AI Infra) is equally compelling. The latest issue of…
As AI agents rise to prominence, traditional code editors can no longer meet developers' needs for debugging, observing, and orchestrating agents. Superset is…
Hugging Face's official blog has announced that DeepInfra — a well-known high-performance, low-cost serverless inference platform — has officially joined…
As the Notion platform has gradually evolved into a broader ecosystem, enabling users and developers to run custom scripts and automated workflows within…
The official Vercel Changelog announces that the bundle size limit for Python Vercel Functions (serverless functions) has been significantly raised to 500MB…
As the reasoning capabilities of Large Language Models (LLMs) improve, building a simple AI Agent has become easier than ever before. Developers can combine a…
Vercel has published a detailed Frequently Asked Questions (FAQ) guide covering "Agent Skills" — the core capability that empowers AI Agents. As AI…
With the explosion of generative AI applications, modern web development has increasingly converged on a golden combination: Next.js (React) on the frontend…
Hugging Face has announced a new partnership with OVHcloud, Europe's leading cloud infrastructure provider, officially incorporating OVHcloud into Hugging Face…
In November 2025, Replicate — the well-known AI model hosting and API services platform — announced that it was officially joining Cloudflare, the global…
This technical post from Vercel dives deep into the challenges they faced and the hands-on lessons they learned while developing and deploying AI Agents. As AI…
Cloud AI model hosting platform Replicate has announced official support for IBM's latest Granite 4.0 model family. This means developers and enterprise users…
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
The AI-managed inference platform Replicate has announced a deep partnership with Hugging Face, the giant of the open-source AI community, officially bringing…
Vercel has officially announced support for deploying MCP (Model Context Protocol) servers. This update allows developers to use Vercel's Serverless…
Vercel has published an official announcement declaring that, effective immediately, "Fluid Compute" will become the default compute architecture for all newly…
On February 18, 2025, Hugging Face announced the addition of three new partners to its serverless inference ecosystem: Hyperbolic, Nebius AI Studio, and Novita…
On February 14, 2025, Hugging Face — the leading open-source AI community — officially announced the integration of high-performance AI inference platform…
Hugging Face has officially launched the "Inference Providers" feature on the Hugging Face Hub — a major update designed to address the pain points developers…
Hugging Face and NVIDIA announced a major partnership in late July 2024, officially launching a serverless inference service powered by NVIDIA NIM (NVIDIA…
Hugging Face and internet infrastructure giant Cloudflare have announced a major partnership that officially brings serverless GPU inference services to…
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
The Yi model series is a bilingual (Chinese and English) large language model trained from scratch by 01.AI, the AI startup founded by Kai-Fu Lee. Upon its…
Hugging Face has announced the launch of Gradio-Lite (@gradio/lite), a new library that enables Gradio applications to run entirely within the user's browser…
The Hugging Face official blog has announced a new "Inference for PROs" upgraded service for PRO subscribers (at $9 per month). This service is designed to…
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…
AI cloud hosting and API service platform Replicate announced a major billing structure overhaul on its official blog, centering on two core changes: price…