Mistral AI introduced Mistral Small 4 as the next major release in the Mistral Small family. It combines reasoning, multimodal, and agentic coding capabilities into one open model with configurable reasoning effort. The model uses a MoE architecture, supports a 256k context window and text-image inputs, and is available through Mistral API, AI Studio, Hugging Face, NVIDIA NIM, and common inference stacks.
Mistral Small 4 is the next major release in the Mistral Small family, unifying Magistral-style reasoning, Pixtral-style multimodality, and Devstral-style coding agents. It uses a MoE architecture with 119B total parameters, 6B active parameters per token, a 256k context window, and configurable reasoning effort. The model is available via Mistral API, AI Studio, Hugging Face, open-source serving stacks, and NVIDIA deployment options.
A community benchmark of Qwen 3.6 27B on DeepSWE yielded a score of 1.79% (18/20th place), slightly outperforming Haiku 4.5. Run on a single RTX 6000 Blackwell GPU via vLLM with reasoning enabled, the test averaged 32 minutes and 44k output tokens per task. The author notes that while Qwen 3.6 27B represents a 'poor man's local SOTA,' the massive gap compared to frontier closed models suggests local LLMs are struggling to keep pace in complex coding.
This AINews feature from Latent Space argues that the AI industry is undergoing a profound transformation — "all the model labs are now agent labs." Over the…
A historic and landmark breakthrough has arrived at the intersection of artificial intelligence and mathematics. According to Latent Space, OpenAI's…
Google DeepMind has announced the launch of its next-generation AI model, Gemini 3.5, positioned as "frontier intelligence with action." This announcement…
As AI technology continues to iterate at a rapid pace, the developer community is confronting a profound rethinking of the question: "Is fine-tuning heading…
This interview records an in-depth conversation between OpenAI theoretical physicist Alex Lupsasca and Latent Space, centered on how GPT-5.x — OpenAI's…
Wharton School professor Ethan Mollick, writing in his well-known newsletter "One Useful Thing," has published a profound analysis of GPT-5.5. He describes…
In this forward-looking article on the state of AI in mid-2026, Interconnects founder Nathan Lambert takes a deep dive into the dynamic gap between open-weight…
As generative AI technology has evolved, the industry's focus has shifted from pure "Large Language Models (LLMs)" to "AI Agents" capable of autonomously…
Google DeepMind has today officially released its latest generation of open-source model series — Gemma 4. The company positions it as "the smartest and most…
This article takes a deep dive into one of the most contentious topics in artificial intelligence: AI "self-improvement" and whether it will trigger a "fast…
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
Wharton School professor Ethan Mollick, in his latest article "The Shape of the Thing," sketches out a clear picture of the current state of AI technological…
Google DeepMind officially released a brand-new model today (February 19, 2026): "Gemini 3.1 Pro." According to the initial official disclosure, the core…
Prominent scholar Ethan Mollick, in his latest article, points out that we have officially crossed beyond the era of simple "Chatbots" and entered what he…
This article by Nathan Lambert takes a deep dive into the tangled competitive dynamics between open-source and closed-source AI models. Lambert argues that…
On February 12, 2026, Google DeepMind announced the launch of its most advanced reasoning mode update — Gemini 3 Deep Think. This model is Google's…
Google DeepMind recently published an article exploring how its deep-reasoning model, "Gemini Deep Think," is transforming the landscape of mathematics and…
This blog post from Hugging Face reviews the full year of technical evolution since the "DeepSeek Moment" at the start of 2025 — the release of DeepSeek-V3 and…
The DeepSeek-V3 and R1 models released in January 2025 have been hailed as the "DeepSeek Moment" in the AI world. This upheaval not only shattered the myth…
NVIDIA and Hugging Face have jointly announced the launch of the new Cosmos Reason 2 model, marking a major breakthrough in the fields of Physical AI and…
As 2025 draws to a close, Google DeepMind has published its annual review, showcasing eight breakthrough research areas in artificial intelligence. This year…
With the successive emergence of models with powerful "reasoning" capabilities — such as OpenAI o1, o3, and DeepSeek-R1 — the challenge of reducing the…
Google DeepMind officially unveiled its latest flagship AI model — Gemini 3 — in November 2025. This marks a new milestone for Google in the field of…
Google DeepMind has officially introduced SIMA 2 (Scalable Instructable Multiworld Agent 2). Compared to its predecessor, the most significant transformation…
Google DeepMind has officially announced the launch of the "AI for Math Initiative," a major program aimed at deeply integrating artificial intelligence into…
The International Mathematical Olympiad (IMO) has been held annually since 1959 and is the most prestigious and difficult mathematics competition for high…
Google DeepMind has announced that its latest reasoning model, "Gemini 2.5 Deep Think," has achieved gold-medal-level performance at the International…