Mistral AI NewsJun 8, 2026, 9:02 AMimportant 84

Introducing Mistral 3

Original: Research Introducing Mistral 3 December 2, 2025 Mistral AI

Mistral 3 brings Apache 2.0 open models spanning Large 3 and edge-focused Ministral variants.

Mistral AI introduced Mistral 3, a new open model family under Apache 2.0. It includes Mistral Large 3, a 675B-parameter sparse MoE with 41B active parameters, plus Ministral 3 models at 3B, 8B, and 14B. The release targets frontier open-weight use, multimodal and multilingual workflows, enterprise customization, and efficient local or edge deployments.

Mistral AI has officially launched Mistral 3, an open model family that spans large frontier models and small edge models. The core of this release includes Mistral Large 3, as well as Ministral 3 in three sizes: 3B, 8B, and 14B. All models are released under the Apache 2.0 license, and for developers, enterprises, and the research community, the key point is not just that they can be tried out, but that they can be deployed, fine-tuned, and further developed under a relatively permissive license. Mistral Large 3 is currently Mistral's most powerful model, using a sparse mixture-of-experts architecture with 675B total parameters and 41B active parameters per inference, and it is offered in both base and instruction fine-tuned versions. The company states that it can compete with major open-weight instruction models on general instruction tasks, while also possessing image understanding and multilingual conversation capabilities, and that it ranks highly in the OSS non-reasoning category on LMArena. On the deployment side, Mistral emphasizes partnerships with NVIDIA, vLLM, Red Hat, and others, providing an NVFP4 checkpoint that supports more efficient execution on Blackwell NVL72, 8xA100, or 8xH100 nodes, and it mentions inference support such as TensorRT-LLM and SGLang. The smaller Ministral 3 series targets local and edge scenarios, with each size offering base, instruct, and reasoning versions, and featuring multimodal and multilingual capabilities; the company particularly emphasizes its cost-effectiveness, stating that the instruct models can achieve comparable or better performance with fewer output tokens in the same class, while the reasoning versions can trade longer reasoning for accuracy. In terms of availability, Mistral 3 is already available on Mistral AI Studio, Amazon Bedrock, Azure Foundry, Hugging Face, Modal, IBM WatsonX, OpenRouter, Fireworks, Unsloth AI, and Together AI, while NVIDIA NIM and AWS SageMaker are expected to support it later. For Taiwanese developers and researchers, the value of this article lies in the fact that Mistral 3 puts open weights, enterprise deployment, edge inference, and multimodal capabilities into the same product line, offering a complete range of choices from small on-device models to large MoE models.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Mistral AI News →

Summaries are AI-generated; the original article is authoritative.