Decart is launching Oasis 3, a real-time world model designed to generate photorealistic driving environments for autonomous vehicle testing. The headline says it can simulate hours of driving, while also noting there are caveats. The model is now available through an API, giving developers a way to build applications or testing workflows on top of it.
Anthropic released Claude Fable 5 and Claude Mythos 5 simultaneously; Fable 5 matches Mythos 5 in capability but adds strict safety classifiers, with new API fallback mechanisms for rejected requests. Both models offer 1M token context, 128K max output, January 2026 knowledge cutoff, priced at $10/$50 per million tokens — double Opus 4.x. Simon's knowledge-breadth test shows Fable 5 substantially outperforms Opus 4.8, listing dozens of his open-source projects with approximate dates from memory alone.
Anthropic announced Claude Fable 5 on June 9, 2026, marking a new naming generation beyond the Claude 4.X family. The announcement URL also references 'Mythos 5,' suggesting a companion model may be included in this release. With model ID claude-fable-5, this is Anthropic's most current model and relevant to developers, researchers, and enterprise users integrating Claude APIs.
Anthropic introduced Claude Opus 4.8 as an upgrade over Opus 4.7, with stronger benchmark performance across coding, agentic skills, reasoning, and knowledge work. The release also adds dynamic workflows in Claude Code, effort controls in claude.ai and Cowork, and new Messages API support for system entries inside the messages array. Pricing for regular usage remains unchanged, while fast mode is now cheaper than previous models.
Google officially unveiled Gemini 3.5 Flash at its 2026 I/O conference. Unlike previous launches, this new model skipped the `-preview` stage and went directly…
OpenAI has continued to expand the reach of its GPT-5 technology, officially launching three new voice and audio APIs: GPT-Realtime-2, GPT-Translate, and…
Hugging Face's official blog has announced that DeepInfra — a well-known high-performance, low-cost serverless inference platform — has officially joined…
Vercel has published a detailed Frequently Asked Questions (FAQ) guide covering "Agent Skills" — the core capability that empowers AI Agents. As AI…
Google DeepMind has announced a major upgrade to its Gemini audio models, aimed at delivering a more natural, fluid, and low-latency voice interaction…
Google DeepMind has officially launched the new dedicated "Gemini 2.5 Computer Use" model, which is now available in preview via API. This model is built on…
Cloud AI model hosting platform Replicate has announced official support for IBM's latest Granite 4.0 model family. This means developers and enterprise users…
As AI applications become more widespread, how to allow large language models (LLMs) to securely and efficiently access enterprise internal data or external…
Grok 3, the flagship AI model from xAI founded by Elon Musk, has finally officially opened its API access months after launch, and simultaneously surprised…
Although AINews characterized these two days as "a calm day," in reality, tech giants and the open-source community remained full of undercurrents. First, on…
On February 18, 2025, Hugging Face announced the addition of three new partners to its serverless inference ecosystem: Hyperbolic, Nebius AI Studio, and Novita…
On February 14, 2025, Hugging Face — the leading open-source AI community — officially announced the integration of high-performance AI inference platform…
Hugging Face has officially launched the "Inference Providers" feature on the Hugging Face Hub — a major update designed to address the pain points developers…
Replicate has published its eighth issue of technical intelligence (Replicate Intelligence #8), bringing three major updates for developers: 1. **Top…
On July 23, 2024, Meta officially released the highly anticipated Llama 3.1 405B — one of the most powerful open-source large language models in the world…
Snowflake recently launched a brand-new open-source large language model called "Snowflake Arctic" — a Mixture of Experts (MoE) model designed for…
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
The Yi model series is a bilingual (Chinese and English) large language model trained from scratch by 01.AI, the AI startup founded by Kai-Fu Lee. Upon its…
The Hugging Face official blog has announced a new "Inference for PROs" upgraded service for PRO subscribers (at $9 per month). This service is designed to…
AI cloud hosting and API service platform Replicate announced a major billing structure overhaul on its official blog, centering on two core changes: price…
Meta's Llama 2 represents a landmark milestone in the history of open-source large language model (LLM) development. Its performance was regarded at the time…
Meta officially launched the highly anticipated open-source large language model Llama 2 on July 18, 2023, immediately triggering a tsunami of cascading…
As the world's largest open-source AI model hub, Hugging Face not only provides model hosting but has also built a complete inference ecosystem. This article…
Hugging Face Inference Endpoints is a fully managed service designed for developers and enterprises, built to solve the pain points of deploying machine…
This blog post from Replicate provides a clear and accessible introduction to running text-to-image models using Replicate's cloud API service. It serves as an…