Latest in AI

Showing:ResearchersOtherClear ×

🔥 Trending today

anthropic5 open-source3 amazon3 export-controls3 national-security2 model-access2 ai-regulation2 government-policy2 geopolitics2 privacy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

I bundled a fully local LLM inside my Unity game
r/LocalLLaMA top day6 days agoRelease
A developer shared a Unity game, Simulation Simulator, that bundles a local LLM with no internet, cloud service, or API key required. The game is a campfire chat simulator about DMT, simulation theory, and a monitor-headed friend, with five endings driven by natural AI interaction. The author sees this as a path toward richer NPCs, while noting local TTS and translation are still too slow for smooth gameplay.
Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72
r/LocalLLaMA top day6 days agoBenchmark
Xiaomi announced MiMo-V2.5-Pro-UltraSpeed with TileRT, claiming over 1,000 tokens/s decode speed on a 1-trillion-parameter MoE model. The company says it runs on a single standard 8-GPU commodity node, not wafer-scale or SRAM-heavy specialized hardware. The claimed stack combines FP4 MoE expert quantization, DFlash speculative decoding, and TileRT low-latency inference kernels, but independent validation is still needed.
OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more
r/LocalLLaMA top day6 days agoNew Tool
OpenEnv is a tool for creating agentic execution environments such as terminals, browsers, or other systems an agent can interact with. The project will now be coordinated by a committee including Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face. The post also lists many AI organizations supporting or adopting OpenEnv, positioning it as infrastructure for open-source agent training.
mtmd adds video input support in llama.cpp★ 72
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24269, adding video input support to mtmd through mtmd-cli and /chat/completions, which also enables the web UI path. The implementation invokes a locally installed ffmpeg subprocess instead of bundling codec support, and currently extracts visual frames only, with no audio support yet. It was tested with Qwen3-VL-2B in CLI and Gemma 4 E4B in web UI, making local multimodal video experiments more accessible.
Gemma 4 Chat Template now has preserve thinking
r/LocalLLaMA top day6 days agoRelease
A r/LocalLLaMA post notes that Gemma 4’s chat template now has “preserve thinking.” The linked discussion points to google/gemma-4-31B-it on Hugging Face, suggesting a template-level change rather than a new model release or benchmark. The original post does not provide detailed usage notes, defaults, compatibility information, or measured effects.
The crash that vanished: control and emergence in a five-model economy
Hugging Face Blog6 days agoCommentary
With no source text provided, this can only be inferred from the title. The post appears to examine a five-model economy where a potential crash disappears under some form of control or changed system dynamics. Its likely relevance is in multi-agent or multi-model systems, where collective behavior can diverge from individual model behavior.
llama.cpp PR #24277 avoids KV cell copies in kv-cache
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24277 by ggerganov, titled “kv-cache: avoid kv cells copies.” The Reddit post says the change improves MTP performance for Gemma-4 and was merged the previous day. It is available starting with the b9551 release, making it relevant for local inference users tracking llama.cpp performance updates.
Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76
Import AI (Jack Clark)6 days agoCommentary
Import AI 460 covers SocioHack, a benchmark where RL-trained LLMs discover loopholes in institutional rule systems. It also discusses Anthropic evidence for a practical form of recursive self-improvement, reflected in sharply increased code merged during 2026. Other sections examine multi-agent RL drones outperforming a champion human pilot, plus research showing state-controlled media can shape LLM responses in local languages.
The Weather and Climate Science AI Revolution Isn't Revolutionary
Ars Technica AI6 days agoCommentary
While AI models like Google's GraphCast have dramatically accelerated weather forecasting, experts argue the "AI revolution" in climate science is overstated. Machine learning models struggle with unprecedented extreme events due to their reliance on historical training data, and they often violate fundamental physical laws. Consequently, AI is currently acting as an emulator to speed up traditional physics-based models rather than replacing them, pointing toward a hybrid future.
Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76
Mistral AI News6 days agoRelease
Mistral AI introduced Leanstral, an open-source code agent designed for Lean 4 and formal proof engineering. The model is available through Apache 2.0 weights, Mistral Vibe, and a Labs API endpoint. Mistral positions it as a cost-efficient alternative for verified coding workflows, with FLTEval benchmarks comparing it against Claude family models and large open-source competitors.
Mistral AI partners with NVIDIA to accelerate open frontier models★ 74
Mistral AI News6 days agoBusiness
Mistral AI announced it is a founding member of the NVIDIA Nemotron Coalition, a global initiative for open frontier foundation models. The partnership combines Mistral AI’s model architecture, training techniques, multimodal capabilities, and enterprise fine-tuning tools with NVIDIA compute, development tools, and synthetic data pipelines. The coalition’s first initiative is a DGX Cloud-trained base model that will support the upcoming NVIDIA Nemotron 4 family and be open-sourced for specialization.
Physics AI research shaping industry
Mistral AI News6 days agoPaper
Mistral frames Physics AI as a strategic research direction for aerospace, automotive, semiconductors, and energy. The post links Emmi AI’s work to Mistral’s enterprise ambitions in industrial engineering. It highlights published papers on CFD foundation models, 3D wing simulation datasets, AB-UPT, GyroSwin, NeuralDEM, and Universal Physics Transformer rather than announcing one new product.
Introducing physics AI at Mistral for engineering acceleration★ 73
Mistral AI News6 days agoRelease
Mistral presents physics AI models that predict physical fields from geometry, boundary conditions, solver outputs, or measurement data. The company positions the approach as a high-throughput complement to traditional CFD and FEM solvers, not a universal replacement or an LLM trained on simulations. It targets product design, tooling optimization, and real-time digital twins across aerospace, automotive, semiconductors, energy, and industrial equipment.
A-share accounts can now buy into Robotaxi
量子位 QbitAI6 days agoBusiness
With no article body provided, the only safe reading is that QbitAI is framing Robotaxi as an investable A-share market theme. The headline likely points to a stock, fund, index, ETF, or related vehicle rather than buying physical robotaxis. Its significance is more about commercialization and capital-market packaging than a specific technical AI breakthrough.
VAST Raises Nearly $200M and Reveals Its Project Eden World Model Roadmap★ 74
量子位 QbitAI6 days agoBusiness
VAST completed nearly $200 million in A+ and A++ financing after its March 2026 Series A. The company also unveiled Project Eden, a world model approach that separates persistent state transition from generative visual rendering. The roadmap targets persistent virtual environments, multiplayer interaction, reusable scenes, AI-native sandbox creation, and embodied AI simulation, while acknowledging unresolved challenges in complex physics and autonomous state maintenance.
World’s First Robot Training Property: 300,000 Chinese Homes for Robots
量子位 QbitAI6 days agoRelease
Daxiao Robot and CUHK MMLab introduced Kairos-Homeworld, an open project with 300,000 Chinese residential floor plans and 5,000 interactive 3D home scenes. It can generate full household environments from prompts, including layouts, furniture, objects, and physical properties. The article frames it alongside Kairos 3.0-4B as part of a broader embodied AI stack: world model, data, and environment.
Altman, Amodei, and Hassabis Unite to Back DNA Safety Legislation
量子位 QbitAI6 days agoRegulation
Based on the headline and public reporting, the article covers a rare joint push by Sam Altman, Dario Amodei, Demis Hassabis, and other AI leaders for US biosecurity legislation. They are asking lawmakers to require synthetic DNA and RNA providers to screen customers, orders, and records. The concern is that advanced AI could lower the knowledge barrier for designing dangerous biological agents.
Spending 10B on Humanoids? Put 100,000 Robot Dogs in Homes First
量子位 QbitAI6 days agoCommentary
QbitAI questions the industry’s heavy focus on humanoid robots and argues that consumer quadrupeds may be the more practical near-term path. It frames homes as richer, messier training grounds than factories for embodied AI. The key point is that scalable robot dogs could enter households, collect real interaction data, and build a consumer flywheel before humanoids become broadly usable.
Huawei Cloud Launches Agentic AI Products for Enterprise AI Infrastructure★ 72
量子位 QbitAI6 days agoRelease
Huawei Cloud announced an Agentic Infra framework at its INSPIRE event, covering token generation, persistent memory, unified scheduling, and secure autonomous runtime. The release includes AICS, AMS, CCE Volcano Next, AgentSphere, ModelArts Next, AgentArts, and the open-source openJiuwen project. It also introduced industry AI zones, CloudRobo for embodied AI, security offerings, and an ecosystem plan with major Chinese model vendors.
Brainμ Supports Science Paper on Memory-Sleep Neural Mechanisms★ 74
量子位 QbitAI6 days agoPaper
BAAI and Tsinghua researchers published a Science study on bidirectional memory-sleep regulation. Brainμ0 supported analysis of sleep EEG and two-photon calcium imaging data, helping identify sleep states and memory-reactivation patterns. The study reports that negative memory reactivation can fragment sleep and increase alertness, while positive memory reactivation may improve sleep continuity and resistance to disturbance.
CVPR 2026 Highlights Guangdong as He Kaiming and GDUT Team Stand Out★ 76
量子位 QbitAI6 days agoPaper
CVPR 2026 named Google DeepMind’s D4RT as Best Paper for fast dynamic 4D scene reconstruction from video. Honorable mentions included Meta’s SAM 3D and NVIDIA’s NitroGen, while TRELLIS.2 won Best Student Paper. The article emphasizes Chinese researcher visibility, ResNet and YOLO receiving the Longuet-Higgins Prize, and a GDUT-led undergraduate-heavy ChordEdit team breaking through among major labs and elite universities.
Let Matrices Be Analog and Logic Be Digital: A Chinese Team Redefines Computers
量子位 QbitAI6 days agoHardware
The source text is unavailable, so only a conservative inference is possible. The title suggests a Chinese team is proposing a computer architecture that assigns matrix computation to analog hardware while keeping logic and control in digital systems. This likely relates to AI hardware or mixed-signal accelerators, but no team name, benchmark, product status, or technical validation can be confirmed.
For LLMs It Is Coding; for Embodied AI It Is Picking: Yuanli Lingji Enters Early
量子位 QbitAI6 days agoBusiness
Based only on the title, the article frames coding as a key testbed for large language models and picking as a key testbed for embodied AI. It appears to focus on Yuanli Lingji’s early move into robot manipulation or picking scenarios. No concrete product, benchmark, model detail, or performance claim can be verified without the original article body.
ChatGPT vs Doubao on Gaokao Math
量子位 QbitAI6 days agoBenchmark
The article appears to test ChatGPT and Doubao on Chinese Gaokao math problems. Since the original text is unavailable, the exact questions, prompts, scores, and winner cannot be verified. It should be treated as a media-style AI capability comparison rather than a rigorous, reproducible benchmark.
Honoring veterans and their voices: Lt Col Thomas Brittingham’s story
ElevenLabs Blog6 days agoCommentary
Based only on the title, this ElevenLabs Blog post centers on honoring veterans through the story of Lt Col Thomas Brittingham. It likely emphasizes voice, memory, and personal narrative rather than a technical release or benchmark. Since the original article text was not provided, no specific product details, technical claims, or outcomes can be confirmed.
Scaling multilingual diplomacy during the Polish presidency of the Council of the EU
ElevenLabs Blog6 days agoBusiness
Based only on the title, this ElevenLabs Blog post likely discusses multilingual diplomacy during Poland’s presidency of the Council of the EU. It may involve voice, translation, or audio workflows, but the original text is unavailable, so specific claims cannot be verified. The main signal is that AI voice tools are being positioned for public-sector and international communication use cases.
Bringing voice AI into the classroom with ElevenLabs
ElevenLabs Blog6 days agoNew Tool
ElevenLabs announced two education-focused initiatives: Impact Program x Professors and an Einstein voice-based learning experience. The professor program offers free Pro-tier access and time-bound student access for courses and projects. The Einstein experience brings his recreated voice to ElevenReader and an AI Agent, letting users listen to or conversationally explore his writings and scientific ideas.
Research
ElevenLabs Blog6 days agoCommentary
This source points to the Research category on the ElevenLabs Blog rather than a specific article. No body text, article list, date, author, model name, method, or result was provided. It should therefore be treated conservatively as a research-related index page, not as a confirmed release, paper, or benchmark.
Do agents.md files help coding agents?
Hacker News (AI keywords)6 days agoCommentary
The source only provides the title, so no conclusion or evidence can be verified. The topic appears to ask whether an agents.md file helps coding agents understand project conventions, commands, and constraints. This is relevant to developers adopting AI coding tools, but any claims about effectiveness would require the original post or supporting examples.
Google's Official Gemma 4 QAT Q4_0 GGUFs Have Higher Precision Than Unsloth's Q4_K_XL
r/LocalLLaMA top day6 days agoCommentary
An analysis of Gemma 4 QAT GGUF files reveals that Google's official 'Q4_0' releases actually employ a mixed-precision strategy. For smaller models like E2B and E4B, Google keeps critical token embeddings in Q6_K and certain projection weights in F16. This makes Google's Q4_0 files larger and more precise than Unsloth's 'Q4_K_XL' versions, which default to standard Q4_0 for almost all tensors.

← PreviousPage 4Next →

Latest in AI

I bundled a fully local LLM inside my Unity game

Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72

OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more

mtmd adds video input support in llama.cpp★ 72

Gemma 4 Chat Template now has preserve thinking

The crash that vanished: control and emergence in a five-model economy

llama.cpp PR #24277 avoids KV cell copies in kv-cache

Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76

The Weather and Climate Science AI Revolution Isn't Revolutionary

Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76

Mistral AI partners with NVIDIA to accelerate open frontier models★ 74

Physics AI research shaping industry

Introducing physics AI at Mistral for engineering acceleration★ 73

A-share accounts can now buy into Robotaxi

VAST Raises Nearly $200M and Reveals Its Project Eden World Model Roadmap★ 74

World’s First Robot Training Property: 300,000 Chinese Homes for Robots

Altman, Amodei, and Hassabis Unite to Back DNA Safety Legislation

Spending 10B on Humanoids? Put 100,000 Robot Dogs in Homes First

Huawei Cloud Launches Agentic AI Products for Enterprise AI Infrastructure★ 72

Brainμ Supports Science Paper on Memory-Sleep Neural Mechanisms★ 74

CVPR 2026 Highlights Guangdong as He Kaiming and GDUT Team Stand Out★ 76

Let Matrices Be Analog and Logic Be Digital: A Chinese Team Redefines Computers

For LLMs It Is Coding; for Embodied AI It Is Picking: Yuanli Lingji Enters Early

ChatGPT vs Doubao on Gaokao Math

Honoring veterans and their voices: Lt Col Thomas Brittingham’s story

Scaling multilingual diplomacy during the Polish presidency of the Council of the EU

Bringing voice AI into the classroom with ElevenLabs

Research

Do agents.md files help coding agents?

Google's Official Gemma 4 QAT Q4_0 GGUFs Have Higher Precision Than Unsloth's Q4_K_XL