Mistral introduced Devstral 2, a 123B coding model, and Devstral Small 2, a 24B variant for lighter deployment. The company reports 72.2% and 68.0% on SWE-bench Verified, respectively, with permissive open-source licensing. It also launched Mistral Vibe CLI, an open-source terminal agent for codebase exploration, multi-file edits, command execution, and IDE integration.
Mistral AI released Mistral Vibe 2.0, a terminal-native coding agent powered by the Devstral 2 model family. The update adds custom subagents, multi-choice clarifications, slash-command skills, unified agent modes, and automatic CLI updates. Vibe is available through Le Chat Pro and Team plans, with pay-as-you-go usage or BYOK options, while Devstral 2 moves to paid API access with free testing on the Experiment plan.
Mistral AI introduced Mistral Small 4 as the next major release in the Mistral Small family. It combines reasoning, multimodal, and agentic coding capabilities into one open model with configurable reasoning effort. The model uses a MoE architecture, supports a 256k context window and text-image inputs, and is available through Mistral API, AI Studio, Hugging Face, NVIDIA NIM, and common inference stacks.
Mistral announced Vibe as the successor to Le Chat, combining work and coding agents under one product and license. Work Mode connects to enterprise apps, documents, mail, calendars, data, and recurring workflows. Code Mode spans the web app, VS Code extension, and CLI, supporting sandboxed coding sessions, tests, diffs, and pull requests.
Boxes.dev appeared on Hacker News as a Show HN post, positioning itself as a way to move Claude Code and Codex workflows from localhost to the cloud. Based only on the title, it seems aimed at cloud development or remote agent execution. The provided source does not include details on architecture, pricing, security, integrations, or limitations.
The source title points to DeepSeek Reasonix, described as a native coding agent for the DeepSeek ecosystem. Its stated emphasis is high caching and low cost, suggesting a design aimed at reducing repeated inference expense during coding workflows. With no article body available, details such as features, benchmarks, pricing, supported IDEs, licensing, or availability cannot be confirmed.