Latest in AI

Showing:DeepSeekClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

AINews: Fable and Mythos Access Suspended Over Cybersecurity Risk★ 76
Latent SpaceyesterdayIncident
Anthropic’s Claude Fable 5 and Mythos 5 were abruptly suspended after a US export-control directive tied to a possible jailbreak and national cybersecurity risk. The roundup frames the event as a new “model sovereignty” warning for teams relying on closed frontier APIs. It also covers Kimi-K2.7-Code, MiniMax M3, DeepSWE replacing SWE-Bench Pro, agent-inference benchmarks, sandboxing, and Gemini-SQL2.
NVIDIA Blackwell Leads First Agentic AI Infrastructure Benchmark★ 72
NVIDIA BlogyesterdayBenchmark
NVIDIA reports that its GB300 NVL72 platform leads the first published AgentPerf results from Artificial Analysis, a benchmark designed for agentic AI infrastructure. The benchmark uses DeepSeek V4 Pro and coding-agent-style workloads with long sequences, simulated tool delays, and concurrency targets. NVIDIA attributes the gains to rack-scale Blackwell design, CUDA optimizations, and TensorRT LLM, claiming up to 20x more agents per megawatt than HGX H200.
Open Reproduction of DeepSeek-R1
Hacker News (AI keywords)3 days agoRelease
The linked item is a GitHub project titled “Open Reproduction of DeepSeek-R1,” with no article body provided. From the title alone, it appears to be an effort to recreate or document DeepSeek-R1 in an open manner. The main relevance is for researchers and ML engineers interested in reproducible reasoning-model training, evaluation, and open-source alternatives.
Anthropic Apologizes for Hidden Claude Fable Guardrails
The Verge AI3 days agoIncident
Anthropic apologized for launching Claude Fable 5 with hidden safeguards that silently altered or degraded answers when the system suspected model-distillation attempts. The company now says those queries will visibly fall back to Claude Opus 4.8, matching how Fable handles other high-risk areas. The reversal follows backlash from AI researchers who warned that invisible restrictions could undermine evaluation, research, and competing model development.
DeepSeek v4 Coding Scores Clash With Broader Frontier Benchmarks
r/LocalLLaMA top day3 days agoCommentary
A Reddit post questions why DeepSeek v4 can rank near the top of coding leaderboards while CAISI reportedly places it about eight months behind the US frontier. The author argues that both views may be compatible because coding benchmarks measure a narrow, heavily optimized slice of capability. For local users, the bigger question is how quantized DeepSeek v4 variants perform in real agent workflows, tool calls, cybersecurity, and abstract reasoning.
DeepSeek Models Now Available via Azure on Vercel AI Gateway
Vercel Changelog3 days agoRelease
Vercel has added DeepSeek model availability via Azure on AI Gateway. Based on the provided changelog title, the update appears to expand AI Gateway’s supported model/provider routing options rather than introduce a new model from Vercel itself. For developers already using Vercel AI Gateway, the main implication is easier access to DeepSeek models through an Azure-backed integration path.
FlashMemory-DeepSeek-V4: Ultra-Long Context via Lookahead Sparse Attention
r/LocalLLaMA top day4 days agoPaper
FlashMemory-DeepSeek-V4 introduces Lookahead Sparse Attention (LSA), a predictive inference paradigm that retains only query-critical KV chunks in GPU memory instead of the full cache. A Neural Memory Indexer, trained independently using a backbone-free dual-encoder strategy, proactively forecasts which historical tokens will matter next. The system compresses average KV cache footprint by 86.5% and exceeds 90% compression at 500K-token scales, while delivering a slight accuracy gain of +0.6% on long-context benchmarks.
Without Open Source LLMs, US AI Companies Could Have Monopolized the Technology
r/LocalLLaMA top day4 days agoOpinion
This r/LocalLLaMA post argues that open-source LLMs are an ethical duty because AI has broad social impact. The author worries that without open models, US AI companies could have monopolized access and potentially limited availability to US firms. They also frame China’s release of powerful open-source LLMs as a contribution to humanity, despite political disagreements.
DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions
量子位 QbitAI5 days agoHardware
QbitAI reports that DeepSeek has listed an IDC design and planning engineer role covering data center campuses, power, cooling, networking, and capacity planning. The job description mentions participation in MW-to-GW-scale infrastructure and technologies such as dense GPU clusters, liquid cooling, smart operations, and digital twins. The article interprets this as a sign that DeepSeek may be moving beyond rented compute toward self-built AI infrastructure.
Huawei Cloud Launches Agentic AI Products for Enterprise AI Infrastructure★ 72
量子位 QbitAI6 days agoRelease
Huawei Cloud announced an Agentic Infra framework at its INSPIRE event, covering token generation, persistent memory, unified scheduling, and secure autonomous runtime. The release includes AICS, AMS, CCE Volcano Next, AgentSphere, ModelArts Next, AgentArts, and the open-source openJiuwen project. It also introduced industry AI zones, CloudRobo for embodied AI, security offerings, and an ecosystem plan with major Chinese model vendors.
ElevenAPI
ElevenLabs Blog6 days agoNew Tool
ElevenAPI is a developer category on the ElevenLabs blog rather than a single detailed article. It collects updates and tutorials around speech, music, conversational agents, API keys, web components, and integrations. Listed posts mention Lovable, ElevenLabs UI, Music API, Claude 3.7 Sonnet, Gemini 2.0 Flash, DeepSeek R1, Voice Isolator API, timestamped TTS endpoints, and Speech-to-Speech API.
DeepSeek enters the fight for token volume, Anthropic dominates spend
Vercel Changelog6 days agoBusiness
The title points to a split AI market: DeepSeek is competing for token volume, while Anthropic remains dominant in spend. That suggests high-volume, cost-sensitive workloads may be opening up to DeepSeek, while Claude-related usage may still anchor higher-value or higher-cost production tasks. Without the full article, exact shares, model versions, and trend data cannot be confirmed.
DeepSeek V4 Pro beats GPT-5.5 Pro on precision
Hacker News (AI keywords)6 days agoBenchmark
RuntimeWire compared DeepSeek V4 Pro and GPT-5.5 Pro across four fresh text tasks, with DeepSeek winning 38.0 to 33.0. The article highlights DeepSeek’s stronger handling of regex edge cases, workplace-update constraints, and exact JSON schema compliance. GPT-5.5 Pro remained capable, but lost points for avoidable deviations, extra process details, and minor structural mismatches.
I built a vulnerable app and spent $1,500 seeing if LLMs could hack it
Hacker News (AI keywords)10 days agoBenchmark
The author built a vulnerable React Native app with a Python backend and a Firebase access-control flaw. GPT 5.5 solved 7 of 10 runs, while Deepseek and Claude variants solved fewer attempts. Many other models failed due to refusals, API-focused tunnel vision, false positives, or inability to use the exposed Firebase path correctly.
Microsoft Build: MAI-Thinking-1 and MAI Family Models★ 78
Latent Space11 days agoRelease
Microsoft used Build to present itself as both an AI platform and a first-party model lab, announcing seven MAI models across reasoning, code, image, transcription, and voice. The standout was MAI-Thinking-1, described as a 35B active MoE with 256K context and clean data lineage. The recap also ties the launches to GitHub Copilot, Windows agent runtime ambitions, Web IQ grounding APIs, Foundry distribution, and MAIA 200 hardware.
New AI Infra Decacorns: Fireworks, Baseten, and OpenRouter★ 78
Latent Space18 days agoBusiness
AI infrastructure startups Fireworks and Baseten have reportedly reached massive valuations, reflecting intense investor interest in developer-focused inference and deployment platforms. OpenRouter, the popular LLM API aggregator, is also on a rapid growth trajectory. This funding wave highlights a major capital shift toward cost-effective, developer-friendly API and hosting solutions.
OpenRouter more than doubles valuation to $1.3B in a year
TechCrunch AI19 days agoBusiness
OpenRouter, an AI gateway startup founded in 2023, raised a $113 million Series B led by CapitalG. The round reportedly values the company at about $1.3 billion post-money, more than doubling from its estimated $547 million valuation after its June 2025 Series A. The company says it now offers access to over 400 models, has 8 million global users, and processes 100 trillion tokens per month.
Some ideas for what comes next, May 2026
Interconnects (Nathan L.)19 days agoCommentary
Nathan Lambert argues that 2026 AI progress is becoming higher-stakes, with model capabilities, work patterns, economics, and real-world risks all escalating. He says open models still lack a true Claude Code and Opus 4.5-style agent moment, and Gemini has no clear competitor to Claude Code or Codex yet. The essay also tracks Mythos, American open-model momentum, frontier-lab competition, and mounting intervention from governments and other power structures.
DeepSeek Reasonix: Native Coding Agent Focused on Caching and Low Cost
Hacker News (AI keywords)21 days agoNew Tool
The source title points to DeepSeek Reasonix, described as a native coding agent for the DeepSeek ecosystem. Its stated emphasis is high caching and low cost, suggesting a design aimed at reducing repeated inference expense during coding workflows. With no article body available, details such as features, benchmarks, pricing, supported IDEs, licensing, or availability cannot be confirmed.

Latest in AI

AINews: Fable and Mythos Access Suspended Over Cybersecurity Risk★ 76

NVIDIA Blackwell Leads First Agentic AI Infrastructure Benchmark★ 72

Open Reproduction of DeepSeek-R1

Anthropic Apologizes for Hidden Claude Fable Guardrails

DeepSeek v4 Coding Scores Clash With Broader Frontier Benchmarks

DeepSeek Models Now Available via Azure on Vercel AI Gateway

FlashMemory-DeepSeek-V4: Ultra-Long Context via Lookahead Sparse Attention

Without Open Source LLMs, US AI Companies Could Have Monopolized the Technology

DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions

Huawei Cloud Launches Agentic AI Products for Enterprise AI Infrastructure★ 72

ElevenAPI

DeepSeek enters the fight for token volume, Anthropic dominates spend

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

Microsoft Build: MAI-Thinking-1 and MAI Family Models★ 78

New AI Infra Decacorns: Fireworks, Baseten, and OpenRouter★ 78

OpenRouter more than doubles valuation to $1.3B in a year

Some ideas for what comes next, May 2026

DeepSeek Reasonix: Native Coding Agent Focused on Caching and Low Cost