Latest in AI

Showing:reasoningResearchersClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Introducing Mistral Small 4★ 76
Mistral AI News6 days agoRelease
Mistral AI introduced Mistral Small 4 as the next major release in the Mistral Small family. It combines reasoning, multimodal, and agentic coding capabilities into one open model with configurable reasoning effort. The model uses a MoE architecture, supports a 256k context window and text-image inputs, and is available through Mistral API, AI Studio, Hugging Face, NVIDIA NIM, and common inference stacks.
Introducing Mistral Small 4★ 78
Mistral AI News6 days agoRelease
Mistral Small 4 is the next major release in the Mistral Small family, unifying Magistral-style reasoning, Pixtral-style multimodality, and Devstral-style coding agents. It uses a MoE architecture with 119B total parameters, 6B active parameters per token, a 256k context window, and configurable reasoning effort. The model is available via Mistral API, AI Studio, Hugging Face, open-source serving stacks, and NVIDIA deployment options.
Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models
r/LocalLLaMA top day6 days agoBenchmark
A community benchmark of Qwen 3.6 27B on DeepSWE yielded a score of 1.79% (18/20th place), slightly outperforming Haiku 4.5. Run on a single RTX 6000 Blackwell GPU via vLLM with reasoning enabled, the test averaged 32 minutes and 44k output tokens per task. The author notes that while Qwen 3.6 27B represents a 'poor man's local SOTA,' the massive gap compared to frontier closed models suggests local LLMs are struggling to keep pace in complex coding.
[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78
Latent Space22 days agoCommentary
This AINews feature from Latent Space argues that the AI industry is undergoing a profound transformation — "all the model labs are now agent labs." Over the…
OpenAI GPT-next 僅花費不到 1,000 美元，便證偽了高達 80 年歷史的 Erdős 平面單位距離猜想★ 90
Latent Space24 days agoRelease
A historic and landmark breakthrough has arrived at the intersection of artificial intelligence and mathematics. According to Latent Space, OpenAI's…
Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85
Google DeepMind Blog29 days agoRelease
Google DeepMind has announced the launch of its next-generation AI model, Gemini 3.5, positioned as "frontier intelligence with action." This announcement…
[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75
Latent Space32 days agoOpinion
As AI technology continues to iterate at a rapid pace, the developer community is confronting a profound rethinking of the question: "Is fine-tuning heading…
OpenAI 物理學家 Alex Lupsasca 專訪：GPT-5.x 如何在理論物理與量子重力領域推導出全新研究成果 (Doing Vibe Physics)★ 85
Latent Space39 days agoCommentary
This interview records an in-depth conversation between OpenAI theoretical physicist Alex Lupsasca and Latent Space, centered on how GPT-5.x — OpenAI's…
未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85
One Useful Thing (Mollick)51 days agoCommentary
Wharton School professor Ethan Mollick, writing in his well-known newsletter "One Useful Thing," has published a profound analysis of GPT-5.5. He describes…
預測 2026 年年中：我對開源 AI 模型的幾點賭注與開閉源差距分析★ 75
Interconnects (Nathan L.)60 days agoOpinion
In this forward-looking article on the state of AI in mid-2026, Interconnects founder Nathan Lambert takes a deep dive into the dynamic gap between open-weight…
深入解析 VAKRA：IBM Research 評估 AI Agent 推理、工具調用與失敗模式的全新基準測試★ 75
Hugging Face Blog60 days agoRelease
As generative AI technology has evolved, the industry's focus has shifted from pure "Large Language Models (LLMs)" to "AI Agents" capable of autonomously…
Gemma 4：同等參數規模下最強大的開源模型，專為進階推理與 Agent 工作流打造★ 85
Google DeepMind Blog73 days agoRelease
Google DeepMind has today officially released its latest generation of open-source model series — Gemma 4. The company positions it as "the smartest and most…
損耗性自我提升：為什麼 AI 自我改進是真的，但不會導致「急遽暴漲」★ 75
Interconnects (Nathan L.)83 days agoOpinion
This article takes a deep dive into one of the most contentious topics in artificial intelligence: AI "self-improvement" and whether it will trigger a "fast…
Hugging Face 開源生態報告：2026 春季版★ 85
Hugging Face Blog89 days agoCommentary
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85
One Useful Thing (Mollick)94 days agoOpinion
Wharton School professor Ethan Mollick, in his latest article "The Shape of the Thing," sketches out a clear picture of the current state of AI technological…
Google DeepMind 發表 Gemini 3.1 Pro：專為複雜任務打造的更智慧模型★ 85
Google DeepMind Blog115 days agoRelease
Google DeepMind officially released a brand-new model today (February 19, 2026): "Gemini 3.1 Pro." According to the initial official disclosure, the core…
代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85
One Useful Thing (Mollick)116 days agoTutorial
Prominent scholar Ethan Mollick, in his latest article, points out that we have officially crossed beyond the era of simple "Chatbots" and entered what he…
開源模型陷入「永久追趕」：開源與閉源的差距、蒸餾、創新週期與開源的勝算★ 80
Interconnects (Nathan L.)117 days agoOpinion
This article by Nathan Lambert takes a deep dive into the tangled competitive dynamics between open-source and closed-source AI models. Lambert argues that…
Google DeepMind 推出 Gemini 3 Deep Think：專為科學、研究與工程設計的深度推理模式★ 90
Google DeepMind Blog122 days agoRelease
On February 12, 2026, Google DeepMind announced the launch of its most advanced reasoning mode update — Gemini 3 Deep Think. This model is Google's…
Gemini Deep Think 加速數學與科學發現：學術研究展現其跨領域的強大推理影響力★ 82
Google DeepMind Blog125 days agoCommentary
Google DeepMind recently published an article exploring how its deep-reasoning model, "Gemini Deep Think," is transforming the landscape of mathematics and…
中國開源 AI 生態系的架構抉擇：超越 DeepSeek 的下一步★ 85
Hugging Face Blog138 days agoCommentary
This blog post from Hugging Face reviews the full year of technical evolution since the "DeepSeek Moment" at the start of 2025 — the release of DeepSeek-V3 and…
「DeepSeek 時刻」一週年：開源 AI 的典範轉移與變革回顧★ 85
Hugging Face Blog145 days agoCommentary
The DeepSeek-V3 and R1 models released in January 2025 have been hailed as the "DeepSeek Moment" in the AI world. This upheaval not only shattered the myth…
NVIDIA 推出 Cosmos Reason 2：為具身智能與物理 AI 注入先進推理能力★ 85
Hugging Face Blog159 days agoRelease
NVIDIA and Hugging Face have jointly announced the launch of the new Cosmos Reason 2 model, marking a major breakthrough in the fields of Physical AI and…
Google 2025 年度回顧：改變科學與 AI 未來的 8 大研究突破領域★ 85
Google DeepMind Blog173 days agoCommentary
As 2025 draws to a close, Google DeepMind has published its annual review, showcasing eight breakthrough research areas in artificial intelligence. This year…
Apriel-H1：揭示蒸餾高效推理模型的驚人關鍵★ 75
Hugging Face Blog207 days agoRelease
With the successive emergence of models with powerful "reasoning" capabilities — such as OpenAI o1, o3, and DeepSeek-R1 — the challenge of reducing the…
Google DeepMind 發表全新一代 Gemini 3：開啟主動式 AI 與超強推理的全新智能時代★ 98
Google DeepMind Blog208 days agoRelease
Google DeepMind officially unveiled its latest flagship AI model — Gemini 3 — in November 2025. This marks a new milestone for Google in the field of…
Google DeepMind 推出 SIMA 2：由 Gemini 驅動、能在 3D 虛擬世界中與你一同遊玩、推理與學習的 AI 代理★ 85
Google DeepMind Blog213 days agoRelease
Google DeepMind has officially introduced SIMA 2 (Scalable Instructable Multiworld Agent 2). Compared to its predecessor, the most significant transformation…
Google DeepMind 啟動「AI for Math Initiative」：攜手全球頂尖機構，加速數學科學新發現★ 75
Google DeepMind Blog228 days agoRelease
Google DeepMind has officially announced the launch of the "AI for Math Initiative," a major program aimed at deeply integrating artificial intelligence into…
搭載 Deep Think 的進階版 Gemini 正式在國際奧林匹亞數學競賽中達到金牌標準★ 90
Google DeepMind Blog233 days agoRelease
The International Mathematical Olympiad (IMO) has been held annually since 1959 and is the most prestigious and difficult mathematics competition for high…
Gemini 2.5 Deep Think 於 ICPC 國際大學生程式設計競賽世界總決賽中達到金牌水準★ 85
Google DeepMind Blog233 days agoRelease
Google DeepMind has announced that its latest reasoning model, "Gemini 2.5 Deep Think," has achieved gold-medal-level performance at the International…

Page 1Next →

Latest in AI

Introducing Mistral Small 4★ 76

Introducing Mistral Small 4★ 78

Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models

[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78

OpenAI GPT-next 僅花費不到 1,000 美元，便證偽了高達 80 年歷史的 Erdős 平面單位距離猜想★ 90

Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85

[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75

OpenAI 物理學家 Alex Lupsasca 專訪：GPT-5.x 如何在理論物理與量子重力領域推導出全新研究成果 (Doing Vibe Physics)★ 85

未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85

預測 2026 年年中：我對開源 AI 模型的幾點賭注與開閉源差距分析★ 75

深入解析 VAKRA：IBM Research 評估 AI Agent 推理、工具調用與失敗模式的全新基準測試★ 75

Gemma 4：同等參數規模下最強大的開源模型，專為進階推理與 Agent 工作流打造★ 85

損耗性自我提升：為什麼 AI 自我改進是真的，但不會導致「急遽暴漲」★ 75

Hugging Face 開源生態報告：2026 春季版★ 85

事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85

Google DeepMind 發表 Gemini 3.1 Pro：專為複雜任務打造的更智慧模型★ 85

代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85

開源模型陷入「永久追趕」：開源與閉源的差距、蒸餾、創新週期與開源的勝算★ 80

Google DeepMind 推出 Gemini 3 Deep Think：專為科學、研究與工程設計的深度推理模式★ 90

Gemini Deep Think 加速數學與科學發現：學術研究展現其跨領域的強大推理影響力★ 82

中國開源 AI 生態系的架構抉擇：超越 DeepSeek 的下一步★ 85

「DeepSeek 時刻」一週年：開源 AI 的典範轉移與變革回顧★ 85

NVIDIA 推出 Cosmos Reason 2：為具身智能與物理 AI 注入先進推理能力★ 85

Google 2025 年度回顧：改變科學與 AI 未來的 8 大研究突破領域★ 85

Apriel-H1：揭示蒸餾高效推理模型的驚人關鍵★ 75

Google DeepMind 發表全新一代 Gemini 3：開啟主動式 AI 與超強推理的全新智能時代★ 98

Google DeepMind 推出 SIMA 2：由 Gemini 驅動、能在 3D 虛擬世界中與你一同遊玩、推理與學習的 AI 代理★ 85

Google DeepMind 啟動「AI for Math Initiative」：攜手全球頂尖機構，加速數學科學新發現★ 75

搭載 Deep Think 的進階版 Gemini 正式在國際奧林匹亞數學競賽中達到金牌標準★ 90

Gemini 2.5 Deep Think 於 ICPC 國際大學生程式設計競賽世界總決賽中達到金牌水準★ 85