Latest in AI

Showing:DevelopersMistralClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 amazon3 national-security2 open-source2 ai-regulation2 government-policy2 enterprise-ai2 compliance2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Vibe gets to work.★ 74
Mistral AI News6 days agoNew Tool
Mistral announced Vibe as the successor to Le Chat, combining work and coding agents under one product and license. Work Mode connects to enterprise apps, documents, mail, calendars, data, and recurring workflows. Code Mode spans the web app, VS Code extension, and CLI, supporting sandboxed coding sessions, tests, diffs, and pull requests.
AI Now Summit 2026★ 72
Mistral AI News6 days agoBusiness
Mistral’s AI Now Summit 2026 post highlights a broader enterprise AI push rather than a single model launch. It introduces Mistral for Industrial Engineering, including work with Airbus, BMW Group, and ASML, and updates Vibe as a unified long-horizon productivity and coding agent. The post also announces the Les Ulis 10 MW inference data center, scheduled for Q3 2026, emphasizing control, security, and infrastructure resilience.
Voxtral TTS★ 76
Mistral AI News6 days agoRelease
Mistral AI introduced Voxtral TTS, its first text-to-speech model, targeting natural multilingual voice generation across nine languages. The 4B-parameter model supports voice adaptation from short references, emotional expressiveness, dialect handling, and low-latency streaming. It is available through API, Mistral Studio, and Le Chat, with open weights on Hugging Face under a non-commercial CC BY NC 4.0 license.
Introducing Mistral 3★ 78
Mistral AI News6 days agoRelease
Mistral AI introduced Mistral 3, a new open model family including Mistral Large 3 and Ministral 3 models at 3B, 8B, and 14B sizes. Large 3 is a 675B-parameter sparse MoE model with 41B active parameters, while Ministral 3 targets local and edge use cases. The models are released under Apache 2.0 and are available through Mistral AI Studio, Hugging Face, Amazon Bedrock, and other platforms.
Introducing Mistral Small 4★ 78
Mistral AI News6 days agoRelease
Mistral Small 4 is the next major release in the Mistral Small family, unifying Magistral-style reasoning, Pixtral-style multimodality, and Devstral-style coding agents. It uses a MoE architecture with 119B total parameters, 6B active parameters per token, a 256k context window, and configurable reasoning effort. The model is available via Mistral API, AI Studio, Hugging Face, open-source serving stacks, and NVIDIA deployment options.
Remote agents in Vibe. Powered by Mistral Medium 3.5.★ 76
Mistral AI News6 days agoRelease
Mistral Medium 3.5 is a 128B dense flagship model with a 256k context window, combining instruction-following, reasoning, and coding. It becomes the default model for Le Chat and Mistral Vibe, enabling cloud-based remote coding agents launched from the CLI or chat. The release also adds Le Chat Work mode for multi-step, cross-tool workflows with visible actions and approval gates for sensitive operations.
LLM Research Papers: The 2026 List (January to May)
Ahead of AI (Raschka)8 days agoPaper
Sebastian Raschka compiles a curated reference list of LLM papers he bookmarked from January through May 2026. The list is not comprehensive, but organized around topics useful for future articles, lectures, code examples, and research work. Public sections emphasize reasoning, RL, efficient inference, long context, agent systems, tool use, coding agents, diffusion language models, and serving infrastructure.
Arithmetic Without Numbers: How LLMs Do Math
Hacker News (AI keywords)9 days agoCommentary
The article asks whether LLM arithmetic is memorization, heuristics, real computation, or experimental assistance. It summarizes Rune experiments that decode operations and operands from frozen Llama activations, then route them to Python under a no-parser rule. The strongest supported claim is narrow: activation-derived tool arguments worked in scoped audits, while residual-state JIT replacement, long-number generation, and cross-model transfer remain brittle.
How LLMs Actually Work
Hacker News (AI keywords)10 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.
New AI Infra Decacorns: Fireworks, Baseten, and OpenRouter★ 78
Latent Space18 days agoBusiness
AI infrastructure startups Fireworks and Baseten have reportedly reached massive valuations, reflecting intense investor interest in developer-focused inference and deployment platforms. OpenRouter, the popular LLM API aggregator, is also on a rapid growth trajectory. This funding wave highlights a major capital shift toward cost-effective, developer-friendly API and hosting solutions.
Reachy Mini goes fully local
Hugging Face Blog18 days agoHardware
Hugging Face published a tutorial for running Reachy Mini conversations without cloud audio processing or API keys. The setup uses its speech-to-speech library as a cascaded VAD, STT, LLM, and TTS pipeline exposed through a Realtime API-compatible WebSocket. Recommended defaults include llama.cpp with Gemma 4, Silero VAD, Parakeet-TDT, and Qwen3-TTS, while allowing swaps to vLLM, MLX, Transformers, or hosted Responses API providers.
DeepInfra 正式加入 Hugging Face 推理服務商（Inference Providers）陣容 🔥★ 72
Hugging Face Blog46 days agoRelease
Hugging Face's official blog has announced that DeepInfra — a well-known high-performance, low-cost serverless inference platform — has officially joined…
Hugging Face 開源生態報告：2026 春季版★ 85
Hugging Face Blog89 days agoCommentary
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
Vercel Chat SDK 推出 Adapter 目錄，大幅簡化多模型與服務整合
Vercel Changelog96 days agoRelease
Vercel has recently launched a brand-new "Adapter Directory" for its widely popular AI development kit "Chat SDK" (also known as the Vercel AI SDK). As…
Transformer 中的混合專家模型 (MoE) 技術解析：原理、優缺點與實作挑戰★ 82
Hugging Face Blog108 days agoTutorial
Mixture of Experts (MoE) has become the mainstream architecture for current large language models (LLMs). This article takes an in-depth look at how MoE…
免費訓練 AI 模型！Hugging Face 聯手 Unsloth 推出 Hugging Face Jobs 免費微調服務★ 85
Hugging Face Blog114 days agoNew Tool
Hugging Face's official blog has announced exciting news for the open-source AI community: Hugging Face has formed a deep partnership with Unsloth — the…
Mistral Large 3 現已支援 Vercel AI Gateway
Vercel Changelog194 days agoRelease
Vercel announced in its official Changelog that Mistral AI's latest flagship large language model, Mistral Large 3 (also known as Mistral Large 24.11), is now…
OVHcloud 正式加入 Hugging Face 推理供應商行列，主打歐洲數據主權與高性價比算力★ 72
Hugging Face Blog202 days agoRelease
Hugging Face has announced a new partnership with OVHcloud, Europe's leading cloud infrastructure provider, officially incorporating OVHcloud into Hugging Face…
Hugging Face 推理提供商迎來新夥伴：Public AI 正式上線 🔥★ 70
Hugging Face Blog270 days agoRelease
Hugging Face continues to expand its "Inference Providers" program, aimed at enabling developers to run open-source models from Hugging Face Hub in the…
Vercel 發表「開放 SDK 策略」：打造跨模型與框架的通用 AI 開發標準★ 75
Vercel Changelog284 days agoOpinion
Vercel recently published its "Open SDK Strategy," centered on shaping its widely popular Vercel AI SDK into an open, neutral, and highly interoperable…
Replicate 推出遠端 MCP 伺服器：可在 Claude、Cursor 與 VS Code 中直接探索與運行模型★ 75
Replicate Blog308 days agoNew Tool
Replicate has officially launched a remote MCP (Model Context Protocol) server. MCP is an open standard created by Anthropic that enables large language models…
Hugging Face 推出 AI Sheets：用開源 AI 模型輕鬆處理與標記數據集的試算表工具★ 75
Hugging Face Blog310 days agoNew Tool
Hugging Face has officially launched a new tool called "AI Sheets," an intuitive spreadsheet tool designed specifically for dataset processing. It aims to make…
在 Hugging Face 上透過 NVIDIA NIM 加速多樣化 LLM 部署★ 80
Hugging Face Blog328 days agoRelease
Hugging Face and NVIDIA have announced a new collaboration to bring NVIDIA NIM (NVIDIA Inference Microservices) into the Hugging Face ecosystem, with the goal…
Groq 正式加入 Hugging Face 推理提供商（Inference Providers）支援極速開源模型推理★ 75
Hugging Face Blog363 days agoRelease
Hugging Face announced a deep partnership with Groq, a chip company focused on ultra-fast AI inference, formally bringing Groq into the Hugging Face "Inference…
Dell Enterprise Hub 助企業輕鬆在本地端建置 AI 應用★ 75
Hugging Face Blog387 days agoRelease
As enterprises place ever-increasing demands on data privacy, security, and regulatory compliance, deploying AI models on-premises has become the preferred…
介紹 HELMET：全面評估長文本語言模型（Long-context LLMs）的新一代基準測試★ 80
Hugging Face Blog424 days agoRelease
### Background and Pain Points: Moving Beyond the Overly Simple "Needle in a Haystack" Test In recent years, the context window length supported by large…
Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85
Hugging Face Blog437 days agoTutorial
Hugging Face's "NLP Course" has long been a must-read classic for developers and researchers worldwide looking to enter the fields of Transformers and natural…
在 Intel Gaudi 上使用 TGI 加速大型語言模型（LLM）推理★ 75
Hugging Face Blog443 days agoRelease
Hugging Face's official blog has announced that its widely adopted open-source large model inference framework, Text Generation Inference (TGI), now officially…
Groq、fal 與 DeepInfra 正式加入 Vercel Marketplace★ 75
Vercel Changelog453 days agoRelease
Vercel has officially announced that three prominent AI infrastructure service providers — Groq, fal, and DeepInfra — have formally joined the Vercel…
Hugging Face Hub 推出「Inference Providers」：一鍵切換多個第三方高效能推理服務商★ 85
Hugging Face Blog502 days agoRelease
Hugging Face has officially launched the "Inference Providers" feature on the Hugging Face Hub — a major update designed to address the pain points developers…

← PreviousPage 2Next →

Latest in AI

Vibe gets to work.★ 74

AI Now Summit 2026★ 72

Voxtral TTS★ 76

Introducing Mistral 3★ 78

Introducing Mistral Small 4★ 78

Remote agents in Vibe. Powered by Mistral Medium 3.5.★ 76

LLM Research Papers: The 2026 List (January to May)

Arithmetic Without Numbers: How LLMs Do Math

How LLMs Actually Work

New AI Infra Decacorns: Fireworks, Baseten, and OpenRouter★ 78

Reachy Mini goes fully local

DeepInfra 正式加入 Hugging Face 推理服務商（Inference Providers）陣容 🔥★ 72

Hugging Face 開源生態報告：2026 春季版★ 85

Vercel Chat SDK 推出 Adapter 目錄，大幅簡化多模型與服務整合

Transformer 中的混合專家模型 (MoE) 技術解析：原理、優缺點與實作挑戰★ 82

免費訓練 AI 模型！Hugging Face 聯手 Unsloth 推出 Hugging Face Jobs 免費微調服務★ 85

Mistral Large 3 現已支援 Vercel AI Gateway

OVHcloud 正式加入 Hugging Face 推理供應商行列，主打歐洲數據主權與高性價比算力★ 72

Hugging Face 推理提供商迎來新夥伴：Public AI 正式上線 🔥★ 70

Vercel 發表「開放 SDK 策略」：打造跨模型與框架的通用 AI 開發標準★ 75

Replicate 推出遠端 MCP 伺服器：可在 Claude、Cursor 與 VS Code 中直接探索與運行模型★ 75

Hugging Face 推出 AI Sheets：用開源 AI 模型輕鬆處理與標記數據集的試算表工具★ 75

在 Hugging Face 上透過 NVIDIA NIM 加速多樣化 LLM 部署★ 80

Groq 正式加入 Hugging Face 推理提供商（Inference Providers）支援極速開源模型推理★ 75

Dell Enterprise Hub 助企業輕鬆在本地端建置 AI 應用★ 75

介紹 HELMET：全面評估長文本語言模型（Long-context LLMs）的新一代基準測試★ 80

Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85

在 Intel Gaudi 上使用 TGI 加速大型語言模型（LLM）推理★ 75

Groq、fal 與 DeepInfra 正式加入 Vercel Marketplace★ 75

Hugging Face Hub 推出「Inference Providers」：一鍵切換多個第三方高效能推理服務商★ 85