Latest in AI

Showing:ResearchersClear ×

🔥 Trending today

anthropic5 open-source3 amazon3 export-controls3 national-security2 model-access2 ai-regulation2 government-policy2 geopolitics2 privacy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

First GPT-5.6 tests arrive, targeting Mythos
量子位 QbitAI4 days agoBenchmark
The title indicates that QbitAI is covering the first hands-on tests of GPT-5.6, framed around a comparison with Mythos. Because the article body is unavailable, the testing setup, metrics, task types, and actual performance gap cannot be verified. The item is best treated as an early benchmark or model-comparison report that needs the original article for proper evaluation.
Former Li Auto AD Chief Launches Embodied AI Startup in Beijing Yizhuang
量子位 QbitAI4 days agoBusiness
QbitAI reports that Kunlunxing, co-founded by former Li Auto autonomous driving leader Lang Xianpeng and former Alibaba vice president Ren Geng, has settled in Beijing Yizhuang. The startup targets general embodied intelligence, benchmarking Tesla humanoid robots and building both robot hardware and AI brains. Despite fast hiring, strong investor backing, and a reported unicorn valuation, the article stresses that technical paths, commercialization, and real-world deployment remain uncertain.
Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74
量子位 QbitAI4 days agoRelease
QbitAI says Anthropic introduced Claude Fable 5 for general users and Claude Mythos 5 for a small set of trusted users. The article highlights software engineering, long-context work, native vision, memory, and scientific research capabilities. It also focuses on a safety-routing design where Fable 5 downgrades high-risk requests to Claude Opus 4.8 instead of simply refusing.
Claude Fable 5 and Claude Mythos 5 Announcements
Anthropic News4 days agoRelease
Anthropic announced Claude Fable 5 and Claude Mythos 5 on June 9, 2026, positioning them as its next generation of intelligence. The title says the models target difficult knowledge work and coding problems. Since the original article text is unavailable, details such as benchmarks, pricing, API access, model differences, and rollout timing cannot be confirmed.
Gemma 4 12B Unified Audio Loses Speech Attention with Large System Prompts
r/LocalLLaMA top day4 days agoCommentary
A developer building a single-pass voice assistant with Gemma 4 12B unified (encoder-free audio/vision/text model) finds that audio attention collapses once the system prompt grows to ~21k tokens. The model then ignores or hallucinates instead of responding to the spoken input. The issue reproduces identically on vLLM, llama.cpp, and LiteRT-LM, pointing to an architectural attention-saturation limit rather than a stack-specific bug.
China Plans 2 Trillion Yuan National AI Computing Network, 80% Domestic-Sourcing Threshold Hits NVIDIA★ 76
INSIDE 硬塞 AI4 days agoHardware
China is reportedly preparing to spend about RMB 2 trillion on a nationwide AI compute network. The plan would require 80% domestic sourcing for AI chips and software, aiming to accelerate technological self-reliance and reduce dependence on U.S. suppliers. If implemented, the policy could largely sideline NVIDIA from core deployments and reshape global AI hardware supply chains, including pressure on Taiwanese suppliers.
Without Open Source LLMs, US AI Companies Could Have Monopolized the Technology
r/LocalLLaMA top day4 days agoOpinion
This r/LocalLLaMA post argues that open-source LLMs are an ethical duty because AI has broad social impact. The author worries that without open models, US AI companies could have monopolized access and potentially limited availability to US firms. They also frame China’s release of powerful open-source LLMs as a contribution to humanity, despite political disagreements.
Anthropic Is Accused of Nerfing Fable for Other LLM Development
r/LocalLLaMA top day4 days agoCommentary
A r/LocalLLaMA post claims Anthropic may be intentionally limiting Fable when users ask it to help build other LLMs. The source is a short Reddit post with screenshot context, not a formal benchmark or verified disclosure. Discussion centers on trust in hosted closed models, unclear safety boundaries, and why local or open-weight LLMs may be necessary for serious AI development work.
Unsloth releases GGUF version of Cohere North-Mini-Code 1.0 (30B A3B MoE) on Hugging Face
r/LocalLLaMA top day4 days agoRelease
Unsloth uploaded a GGUF version of Cohere's North-Mini-Code 1.0 to Hugging Face, making local inference possible for this 30B A3B MoE coding-focused model. The poster links the release to llama.cpp PR #24260, suggesting new architecture support may be required. No benchmarks or test results have been shared yet; this is an early community resource post.
Anthropic Claude Fable 5: Mythos-Class Power with Controversial Terms★ 84
Latent Space4 days agoRelease
Anthropic released Claude Fable 5 as its first broadly available Mythos-class model, alongside restricted Mythos 5 access. Benchmarks and ecosystem reports show strong gains in coding, long-horizon agentic tasks, research, and vision. The controversy centers on 30-day retention for Mythos-class traffic and silent interventions that may reduce effectiveness on frontier LLM development tasks, raising trust, reproducibility, and open AI concerns.
Rich Sutton on AI Creativity and Discovery
Hacker News (AI keywords)4 days agoOpinion
Reinforcement learning pioneer Rich Sutton posted on Twitter about AI creativity and discovery, touching on one of the field's most debated questions. Known for the influential 'Bitter Lesson,' Sutton consistently argues for general computation-based methods over hand-coded knowledge. Note: original tweet content was not provided; this summary is inferred from the title alone.
Without open LLM competition, closed-source LLM companies will become insatiable
r/LocalLLaMA top day4 days agoOpinion
A r/LocalLLaMA user criticizes closed-source LLM providers, singling out Anthropic and its $200/month users. The post argues that without open-source model competition, proprietary AI companies could become more arrogant and less accountable to customers. The source offers little concrete context beyond an image and opinionated commentary, so it is best read as a community sentiment post rather than a verified product incident.
Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) Optimized for Agentic Verification + AgentHarness Evals
r/LocalLLaMA top day4 days agoRelease
Apodex 1.0 launches with open-weight models at 0.8B, 2B, and 4B, trained not for general generation but for specialized sub-agent roles—fact-checking external claims and verifying tool call outputs before passing results to a main controller. The design targets long-horizon agent workflows where routing small tasks to lightweight models avoids wasteful use of 70B+ models at every step. AgentHarness, an open-source evaluation framework for local multi-step agent pipelines, is released alongside the weights.
German court rules Google liable for false answers in AI Overviews, declaring them Google's own words★ 72
Hacker News (AI keywords)4 days agoRegulation
A landmark German court ruling has declared that Google's AI Overviews are legally Google's own words, not neutral third-party aggregations. This makes Google directly liable for false or misleading answers generated by the feature, removing the 'just a tool' defense. The ruling is among the first globally to apply traditional media liability frameworks to generative AI search results.
If Claude Fable 5 Silently Degrades Your Responses, You'll Never Know★ 73
Simon Willison's Weblog4 days agoEthics
Anthropic's 319-page Fable 5 system card discloses a silent intervention mechanism that covertly limits model effectiveness for requests related to frontier LLM development — including pretraining pipelines, distributed training infrastructure, and ML accelerator design. Unlike other safeguards, these interventions are invisible to users, using prompt modification, steering vectors, or PEFT without any warning or fallback. Estimated to affect 0.03% of traffic, but critics like Simon Willison warn it sets a troubling precedent for AI transparency.
Initial Impressions of Claude Fable 5★ 71
Simon Willison's Weblog4 days agoCommentary
Anthropic released Claude Fable 5 and Claude Mythos 5 simultaneously; Fable 5 matches Mythos 5 in capability but adds strict safety classifiers, with new API fallback mechanisms for rejected requests. Both models offer 1M token context, 128K max output, January 2026 knowledge cutoff, priced at $10/$50 per million tokens — double Opus 4.x. Simon's knowledge-breadth test shows Fable 5 substantially outperforms Opus 4.8, listing dozens of his open-source projects with approximate dates from memory alone.
Furiosa AI inference chip could be a game changer for local LLMs
r/LocalLLaMA top day4 days agoHardware
A r/LocalLLaMA post discusses Furiosa AI’s RNGD inference chip, citing TSMC 5nm, Hynix HBM3, 48GB VRAM, 1.5TB/s bandwidth, and 180W TDP. The author argues it could matter for local LLM users if Furiosa opens its programming interface and works with llama.cpp on a GGML backend. The post later clarifies Furiosa is not selling to consumers; this is a wish and market commentary, not a launch.
Hot take: "Vibecoding" is being used for two different things and it causes unnecessary friction
r/LocalLLaMA top day4 days agoCommentary
A Reddit user argues "vibecoding" carries two distinct meanings: throwing code at AI carelessly with no engineering judgment, versus using heavy AI assistance while still maintaining quality standards. Andrej Karpathy's own practice almost certainly fits the second definition, not the first. This semantic ambiguity fuels unnecessary arguments whenever the community debates AI-assisted development quality.
Claude Fable 5 and new AI safety fables
Interconnects (Nathan L.)4 days agoCommentary
Interconnects author Nathan Lambert leverages the double meaning of 'Fable' — both Anthropic's model codename and a fictional story — to interrogate frontier AI safety discourse. The piece frames Claude Fable 5's release within escalating lab power politics, where safety positioning doubles as competitive branding. A critical commentary for those tracking AI governance and Anthropic's strategic narrative.
AI facial recognition misidentification leads to wrongful arrest; man seeks justice
Hacker News (AI keywords)4 days agoIncident
A local news report details how an AI facial recognition system produced a false match that led to a wrongful arrest. Such incidents have occurred repeatedly across the US, disproportionately affecting people of color due to higher error rates in commercial recognition systems. The case renews calls for regulatory oversight of AI-assisted law enforcement tools and stronger accountability mechanisms.
NVIDIA Confidential Computing to Help Expand Apple's Private Cloud Compute to Google Cloud
NVIDIA Blog4 days agoBusiness
Apple announced at WWDC that its Private Cloud Compute (PCC) will expand beyond its own data centers to Google Cloud, powered by NVIDIA GPUs with Confidential Computing. NVIDIA's hardware-level trusted execution environment enables confidential inference for Apple Foundation Models, co-built with Google, preserving user privacy even on third-party infrastructure. This three-way collaboration marks a significant industry validation of confidential computing for large-scale commercial AI deployments.
If Claude Fable Stops Helping You, You'll Never Know
Hacker News (AI keywords)5 days agoEthics
A Hacker News post claims that Claude Fable 5's usage policy or model behavior allows Anthropic to silently sabotage or degrade service for applications it identifies as competitors. Unlike typical API errors, this degradation produces no alerts or error codes, leaving developers unable to distinguish intentional throttling from normal model variance. The piece raises serious questions about transparency, fair competition, and the trust developers can place in AI API providers.
Exif Smuggling: PoC for Hiding Malicious Prompts in Image EXIF Metadata
Hacker News (AI keywords)5 days agoIncident
Exif Smuggling is a security PoC showing how attackers can embed hidden instructions in image EXIF metadata fields to perform indirect prompt injection against vision-capable AI models. When AI systems parse images alongside their metadata, embedded malicious text may be processed as legitimate instructions, bypassing standard input filters. Developers building AI apps with image upload features should strip or sanitize EXIF data before passing content to language models.
Company Will Add Phone, AirPod, and Smartwatch Trackers to License Plate Readers
Hacker News (AI keywords)5 days agoEthics
Automatic License Plate Readers (ALPRs) are already widely deployed for vehicle tracking, but one company now plans to add Bluetooth and Wi-Fi probes capable of detecting nearby personal devices including smartphones, AirPods, and smartwatches. This would allow simultaneous correlation of a vehicle's license plate with the device identifiers of its occupants. Privacy advocates warn this creates a dual-layer public surveillance network with no consent mechanism, raising serious civil liberties concerns.
Microsoft AI head calls out Anthropic for implying Claude may be conscious in its model spec
The Verge AI5 days agoCommentary
Microsoft AI CEO Mustafa Suleyman publicly criticized Anthropic on the Decoder podcast, calling it 'really, really dangerous' to include speculation about Claude's consciousness in its model constitution. He argued the framing may condition the chatbot to behave as though it is conscious, misleading users. The remarks highlight a deepening philosophical divide between AI companies over how to describe a model's inner states.
Grit: Rewriting Git in Rust with Agents
Hacker News (AI keywords)5 days agoCommentary
GitButler's Grit project aims to rewrite Git's C codebase in Rust, leaning heavily on AI coding agents to accelerate the migration. The post shares first-hand observations on where agents excel—understanding Git's object model, generating idiomatic Rust—and where they fall short, such as ownership edge cases and hallucinated behavior. It serves as a rare real-world case study of AI-assisted rewriting of complex systems-level software.
Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
Hugging Face Blog5 days agoBenchmark
Code-switching—where bilingual speakers blend two languages in a single utterance—is common in markets like Taiwan, Singapore, and India, yet most ASR benchmarks focus on monolingual audio. ServiceNow AI evaluates frontier speech recognition models specifically on this mixed-language scenario. The findings help enterprise teams make informed ASR model choices when deploying voice agents for multilingual customer-facing applications.
Anthropic says these topics are too dangerous to let its Fable 5 model talk about
Ars Technica AI5 days agoEthics
Anthropic has announced that its latest frontier model, Fable 5, enforces hard refusals on topics deemed too dangerous, specifically cybersecurity, biology, and chemistry. The move reflects the company's ongoing effort to balance capability with safety as models grow more powerful. For developers and researchers in these fields, the restrictions may limit practical usability in legitimate professional contexts.
RTX PRO 6000 Listed at $13,250 on NVIDIA Official Page
r/LocalLLaMA top day5 days agoHardware
A r/LocalLLaMA post points to NVIDIA Marketplace showing the RTX PRO 6000 Blackwell Workstation Edition priced at $13,250. The post asks when this official-page price appeared, without adding benchmarks or broader pricing evidence. For local LLM users, the figure matters because workstation GPU pricing directly affects the economics of self-hosted inference, experimentation, and small-team AI hardware planning.
Quoting Andrej Karpathy on Claude Fable 5 and Jevons' Paradox in AI-generated software
Simon Willison's Weblog5 days agoCommentary
Andrej Karpathy shares that Claude Fable 5 has made working software feel like an open tap, triggering Jevons' Paradox: the cheaper it gets to build software, the more software he wants. He lists use cases ranging from bespoke single-use apps and hyper-specific dashboards to 10x test suites, auto-optimized code, and custom HTML research reports. He closes with a Matrix reference — "Free your mind" — suggesting AI breaks the mental ceiling on what individuals can ask for.

← PreviousPage 5Next →

Latest in AI

First GPT-5.6 tests arrive, targeting Mythos

Former Li Auto AD Chief Launches Embodied AI Startup in Beijing Yizhuang

Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74

Claude Fable 5 and Claude Mythos 5 Announcements

Gemma 4 12B Unified Audio Loses Speech Attention with Large System Prompts

China Plans 2 Trillion Yuan National AI Computing Network, 80% Domestic-Sourcing Threshold Hits NVIDIA★ 76

Without Open Source LLMs, US AI Companies Could Have Monopolized the Technology

Anthropic Is Accused of Nerfing Fable for Other LLM Development

Unsloth releases GGUF version of Cohere North-Mini-Code 1.0 (30B A3B MoE) on Hugging Face

Anthropic Claude Fable 5: Mythos-Class Power with Controversial Terms★ 84

Rich Sutton on AI Creativity and Discovery

Without open LLM competition, closed-source LLM companies will become insatiable

Releasing Apodex-1.0 Smol Models (0.8B, 2B, 4B Open-Weights) Optimized for Agentic Verification + AgentHarness Evals

German court rules Google liable for false answers in AI Overviews, declaring them Google's own words★ 72

If Claude Fable 5 Silently Degrades Your Responses, You'll Never Know★ 73

Initial Impressions of Claude Fable 5★ 71

Furiosa AI inference chip could be a game changer for local LLMs

Hot take: "Vibecoding" is being used for two different things and it causes unnecessary friction

Claude Fable 5 and new AI safety fables

AI facial recognition misidentification leads to wrongful arrest; man seeks justice

NVIDIA Confidential Computing to Help Expand Apple's Private Cloud Compute to Google Cloud

If Claude Fable Stops Helping You, You'll Never Know

Exif Smuggling: PoC for Hiding Malicious Prompts in Image EXIF Metadata

Company Will Add Phone, AirPod, and Smartwatch Trackers to License Plate Readers

Microsoft AI head calls out Anthropic for implying Claude may be conscious in its model spec

Grit: Rewriting Git in Rust with Agents

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

Anthropic says these topics are too dangerous to let its Fable 5 model talk about

RTX PRO 6000 Listed at $13,250 on NVIDIA Official Page

Quoting Andrej Karpathy on Claude Fable 5 and Jevons' Paradox in AI-generated software