The Technology Innovation Institute (TII) of the UAE recently officially unveiled a brand-new open-source language model series on the Hugging Face blog —…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new language model series called "Falcon-Arabic" on the Hugging Face platform…
At Google I/O 2025, Google DeepMind announced the launch of the new "SynthID Detector" portal. This tool is designed to address the increasingly serious…
Google announced new generative media models and tools at I/O 2025, led by Veo 3 for video, Imagen 4 for images, and Flow for AI filmmaking. Veo 3 adds audio generation, while Imagen 4 improves detail, typography, aspect ratios, and up to 2K output. Google also expanded Lyria 2 and Lyria RealTime access, while continuing SynthID watermarking and launching SynthID Detector.
Vercel recently announced a new "one-click AI bot managed ruleset" in its Changelog. With the rapid rise of generative AI, web crawlers operated by major AI…
With the explosion of multimodal technology, Vision Language Models (VLMs) have evolved from laboratory research prototypes into core tools for enterprises and…
In the history of artificial intelligence, the appearance of the ImageNet dataset in 2012 is widely recognized as the key catalyst that ignited the deep…
Vercel recently made an important upgrade to its built-in Observability tools, introducing the new "Quick Actions" feature. This new capability is designed to…
Ideogram, a standout in the AI image generation space, has launched its latest-generation model Ideogram 3.0, which is now officially available on the AI model…
The AI development platform Replicate has announced official support for MiniMax's Speech-02 voice generation model API. MiniMax, a leading AI research team…
The Vercel official blog introduced how its AI web generation tool, v0, now incorporates Search Engine Optimization (SEO) best practices by default when…
### Background With the proliferation of vision-language models (VLMs), using VLMs for document OCR (e.g., converting PDFs to Markdown) has become mainstream…
AI video generation has reached a major milestone: Google's Veo 2 and Kuaishou's Kling 2, currently ranked at the top of the Artificial Analysis Video Arena…
Hugging Face's official blog announced that Cohere, the well-known enterprise AI research and development company, has officially joined Hugging Face's…
Well-known AI imaging brand Easel AI has officially announced that its advanced face swap and AI avatar generation models are now available on the popular…
The Language Technologies department (BSC-LT) of the Barcelona Supercomputing Center (BSC) recently released a new open-source multimodal model on Hugging Face…
At NVIDIA GTC 2025, NVIDIA unveiled a remarkable set of new open-source models and datasets for the field of "Physical AI" — also known as embodied…
Since its launch, Hugging Face's Open R1 project has been dedicated to replicating the reasoning capabilities of DeepSeek-R1 in a fully open-source manner. In…
As the hardware performance of mobile devices continues to improve, "edge inference" — running large language models (LLMs) directly on smartphones — has…
Alibaba's open-source Wan2.1 is a text-to-video model that has been attracting considerable attention. To help developers and creators get the most out of this…
Cohere For AI (C4AI) has officially launched "Aya Vision," a series of open-source multimodal models (available in 8B and 32B parameter versions) designed…
In the current era of generative AI sweeping the globe, many developers habitually feed all tasks — including simple text classification, sentiment analysis…
In modern web development workflows, there has long been an invisible gap between "designers" and "engineers." The pixels and interactions that designers…
### Background and Pain Points As large language models (LLMs) have become widespread, the file sizes hosted on the Hugging Face Hub have grown dramatically…
Hugging Face has officially published the second technical update (Update #2) for the Open R1 project, which aims to replicate DeepSeek-R1's reasoning model…
Physical Intelligence, a Physical AI startup founded by robotics luminary Sergey Levine and others, has officially open-sourced its flagship robot foundation…
### Background and the Goals of the Open-R1 Project Since the release of DeepSeek-R1, its powerful reasoning capability and remarkably low training cost have…
As DeepSeek-R1 swept through the AI landscape on the strength of its powerful reasoning capabilities, how to safely and efficiently deploy and fine-tune these…
v0, the AI-powered web generation tool from Vercel, has received a major workflow upgrade, officially supporting integration with Figma and Custom Design…
This official Hugging Face blog post takes an in-depth look at the current state of open-source video generation models within the Diffusers ecosystem. As…