Latest in AI

Showing:ResearchersClear ×

🔥 Trending today

open-source3 anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Full Reverse Engineering of the TI-84 Plus Operating System
Hacker News (AI keywords)6 days agoHardware
This Hacker News item links to an article titled “Full Reverse Engineering of the TI-84 Plus Operating System.” Based on the provided material, the reliable takeaway is that it concerns reverse engineering the OS of Texas Instruments’ TI-84 Plus graphing calculator. The original text was not provided, so specific claims about methods, findings, code, memory layout, or security implications cannot be verified here.
LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs
r/LocalLLaMA top day6 days agoOpinion
A popular r/LocalLLaMA post urges local LLM supporters not to invest in IPOs tied to SpaceX, OpenAI, or Anthropic. The author argues that frontier labs drive up demand and prices for GPUs, RAM, SSDs, HDDs, and NAS hardware, making local inference harder. The post also questions AI company valuations, but its claims are mostly opinion and speculation without cited evidence.
Show HN: Gitdot – a better GitHub, open-source, anti-AI, written in Rust
Hacker News (AI keywords)6 days agoNew Tool
Gitdot appeared on Hacker News as a Show HN project claiming to be “a better GitHub.” The title says it is open-source, written in Rust, and explicitly anti-AI. No article body was provided, so details about features, licensing, deployment, maturity, and how it differs from GitHub cannot be confirmed from the source.
An Implementation of NanoQuant: A Flexible Binary Quantization Method
r/LocalLLaMA top day6 days agoNew Tool
A r/LocalLLaMA post presents an unofficial PyTorch implementation of NanoQuant, a 2026 post-training quantization method for dense transformers. The method factorizes weights into scaling vectors and binary matrices, then quantizes and fine-tunes blocks sequentially to reduce hardware requirements. Early Qwen3-0.6B and Qwen3-4B experiments are promising for base models, but instruct quality remains weak and highly dependent on calibration data.
I bundled a fully local LLM inside my Unity game
r/LocalLLaMA top day6 days agoRelease
A developer shared a Unity game, Simulation Simulator, that bundles a local LLM with no internet, cloud service, or API key required. The game is a campfire chat simulator about DMT, simulation theory, and a monitor-headed friend, with five endings driven by natural AI interaction. The author sees this as a path toward richer NPCs, while noting local TTS and translation are still too slow for smooth gameplay.
NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources
The Verge AI6 days agoRelease
Google is rolling out broad updates to NotebookLM, its AI-powered note-taking and research app launched in 2023. The app now uses Google’s upgraded Gemini 3.5 model, which the company says should provide more accurate and reliable responses. The update also adds a cloud computer and help finding sources, expanding NotebookLM beyond source-based Q&A into a broader research assistant workflow.
Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72
r/LocalLLaMA top day6 days agoBenchmark
Xiaomi announced MiMo-V2.5-Pro-UltraSpeed with TileRT, claiming over 1,000 tokens/s decode speed on a 1-trillion-parameter MoE model. The company says it runs on a single standard 8-GPU commodity node, not wafer-scale or SRAM-heavy specialized hardware. The claimed stack combines FP4 MoE expert quantization, DFlash speculative decoding, and TileRT low-latency inference kernels, but independent validation is still needed.
Why are so many young people getting cancer? What researchers do and don't know
Hacker News (AI keywords)6 days agoCommentary
Nature reports that researchers are investigating why more young people are developing cancers once associated mainly with older age. Emerging explanations exist, but the article stresses that causes are likely to differ by tumor type. The visible article metadata frames the issue as cancer, public health, and epidemiology, with many uncertainties still unresolved.
AI Is Slowing Down
Hacker News (AI keywords)6 days agoCommentary
The article argues generative AI must keep accelerating to justify massive data center, cloud, and GPU commitments. Zitron says OpenAI, Anthropic, hyperscalers, and NVIDIA depend on AI services reaching extraordinary revenue levels by 2029-2030. He points to token-based billing, weak ROI visibility, enterprise spending caps, and customer pushback as signs that demand may be cooling before the infrastructure bet can pay off.
Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72
r/LocalLLaMA top day6 days agoNew Tool
Luce Spark is an open-source MoE offload system for running 33B-35B A3B models on 16GB-class GPUs. It keeps frequently routed experts on GPU, stores the long tail in system RAM, and swaps cold experts through a bounded async cache. The author reports 13.3 GiB for Qwen3.6 35B-A3B and about 100 tok/s with Spark optimizations, but notes real 16GB GPU testing is still missing.
World Capitals Voronoi: Redrawing the World Map by Nearest Capital
Hacker News (AI keywords)6 days agoCommentary
Jason Davies’ map divides the world into regions based on the closest national capital rather than political borders. The page says it uses a spherical Voronoi diagram, accounting for Earth’s curvature when computing distances. The data source is Natural Earth’s 1:10m Cultural Vectors for Admin-0 capitals, making this a geography and visualization item, not an AI release.
OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more
r/LocalLLaMA top day6 days agoNew Tool
OpenEnv is a tool for creating agentic execution environments such as terminals, browsers, or other systems an agent can interact with. The project will now be coordinated by a committee including Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face. The post also lists many AI organizations supporting or adopting OpenEnv, positioning it as infrastructure for open-source agent training.
[3090] Gemma4 QAT + MTP quick TPS numbers
r/LocalLLaMA top day6 days agoBenchmark
A r/LocalLLaMA user shared quick throughput numbers for Gemma4 QAT with MTP speculative decoding on an RTX 3090 24GB setup. They report roughly 1.2-1.8x TPS improvement, with Gemma 4 31B moving from about 40 tok/s to 70-80 tok/s. The author frames this as a rough benchmark, using 11 task categories and noting stochastic variation from temp 1.0.
mtmd adds video input support in llama.cpp★ 72
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24269, adding video input support to mtmd through mtmd-cli and /chat/completions, which also enables the web UI path. The implementation invokes a locally installed ffmpeg subprocess instead of bundling codec support, and currently extracts visual frames only, with no audio support yet. It was tested with Qwen3-VL-2B in CLI and Gemma 4 E4B in web UI, making local multimodal video experiments more accessible.
Gemma 4 Chat Template now has preserve thinking
r/LocalLLaMA top day6 days agoRelease
A r/LocalLLaMA post notes that Gemma 4’s chat template now has “preserve thinking.” The linked discussion points to google/gemma-4-31B-it on Hugging Face, suggesting a template-level change rather than a new model release or benchmark. The original post does not provide detailed usage notes, defaults, compatibility information, or measured effects.
The crash that vanished: control and emergence in a five-model economy
Hugging Face Blog6 days agoCommentary
With no source text provided, this can only be inferred from the title. The post appears to examine a five-model economy where a potential crash disappears under some form of control or changed system dynamics. Its likely relevance is in multi-agent or multi-model systems, where collective behavior can diverge from individual model behavior.
Google DeepMind RCT in Sierra Leone Shows Gemini's Guided Learning Boosts Education★ 72
Google DeepMind Blog6 days agoPaper
Google DeepMind released results from a randomized controlled trial (RCT) in Sierra Leone evaluating AI's impact on education. The study found that Gemini’s "Guided Learning" feature, which guides students instead of just giving answers, significantly boosted engagement. This research provides rigorous empirical evidence that AI tutoring can accelerate learning and help bridge educational gaps in resource-constrained regions.
What was your local daily driver for coding last week?
r/LocalLLaMA top day6 days agoCommentary
This r/LocalLLaMA post is a brief community poll asking users what their local coding daily driver was last week. The post asks commenters to share their favorite model and quant, but the provided text does not include poll options, results, or specific model names. Its value is mainly as a community signal for tracking local LLM coding preferences.
llama.cpp PR #24277 avoids KV cell copies in kv-cache
r/LocalLLaMA top day6 days agoRelease
ggml-org/llama.cpp merged PR #24277 by ggerganov, titled “kv-cache: avoid kv cells copies.” The Reddit post says the change improves MTP performance for Gemma-4 and was merged the previous day. It is available starting with the b9551 release, making it relevant for local inference users tracking llama.cpp performance updates.
Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76
Import AI (Jack Clark)6 days agoCommentary
Import AI 460 covers SocioHack, a benchmark where RL-trained LLMs discover loopholes in institutional rule systems. It also discusses Anthropic evidence for a practical form of recursive self-improvement, reflected in sharply increased code merged during 2026. Other sections examine multi-agent RL drones outperforming a champion human pilot, plus research showing state-controlled media can shape LLM responses in local languages.
The Weather and Climate Science AI Revolution Isn't Revolutionary
Ars Technica AI6 days agoCommentary
While AI models like Google's GraphCast have dramatically accelerated weather forecasting, experts argue the "AI revolution" in climate science is overstated. Machine learning models struggle with unprecedented extreme events due to their reliance on historical training data, and they often violate fundamental physical laws. Consequently, AI is currently acting as an emulator to speed up traditional physics-based models rather than replacing them, pointing toward a hybrid future.
Cohere and Mila Partner to Advance Quebec French Language and Culture in AI
Cohere Blog6 days agoBusiness
Cohere has partnered with Mila, the Quebec AI Institute, to improve the representation of Quebec French (Québécois) and its cultural nuances in AI. The collaboration aims to address the European French bias in current models by leveraging Cohere's multilingual capabilities and Mila's research expertise. This initiative will help deliver more culturally accurate AI solutions for Quebec's public and private sectors.
Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI
Cohere Blog6 days agoBusiness
Cohere highlights its enterprise AI solutions tailored for the healthcare and life sciences sectors. By utilizing its Command, Embed, and Rerank models, Cohere enables medical institutions and pharmaceutical companies to securely retrieve and analyze complex clinical data. This accelerates drug discovery, streamlines clinical trials, and improves administrative efficiency while ensuring strict regulatory compliance.
Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80
Cohere Blog6 days agoRelease
Cohere has announced "Cohere Transcribe," a new state-of-the-art open-source speech recognition model. Designed to deliver highly accurate and efficient speech-to-text capabilities, it represents Cohere's expansion into open-source audio AI. The model aims to challenge existing industry benchmarks like OpenAI's Whisper by offering superior multilingual performance.
Cohere Blog: Technology Tag Page Overview
Cohere Blog6 days agoCommentary
This page aggregates all technology-focused articles on the Cohere blog. As an enterprise-focused AI company, Cohere's technical content primarily covers its Command LLM family, industry-leading Embed and Rerank models, and practical RAG implementation guides. It serves as a key resource for developers and enterprise architects tracking Cohere's technical evolution.
Cohere Highlights Thomas Euyang as Research Visual Storyteller
Cohere Blog6 days agoBusiness
Cohere highlights the role of Thomas Euyang, a Research Visual Storyteller at the company. His work focuses on translating complex machine learning research and LLM concepts into intuitive, engaging visual narratives. This spotlight underscores the growing importance of design and visual communication in making advanced AI research accessible to developers and the public.
Co/plot: Supporting the Research Process Through Visualization
Cohere Blog6 days agoNew Tool
Cohere has announced "Co/plot," a tool dedicated to supporting the research process through advanced visualization. It aims to help researchers and developers better understand complex data structures, model behaviors, and research workflows. This launch highlights Cohere's expanding focus on building practical developer and researcher tools that complement their core LLM and embedding models.
Cohere's Commitment to Open Science and Collaborative AI Research
Cohere Blog6 days agoCommentary
Cohere's Open Science initiative, primarily driven by its non-profit research lab Cohere For AI (C4AI), focuses on democratizing AI research. By releasing open-weights models like Aya and fostering global research collaborations, Cohere aims to bridge the gap in multilingual AI representation. This approach highlights their commitment to community-driven, accessible AI development.
Cohere Product Launch Blog Hub
Cohere Blog6 days agoRelease
This link directs to Cohere's official "Product Launch" blog category. It serves as a centralized hub aggregating all major product announcements, including the Command LLM series, Embed models, Rerankers, and developer platform updates. It is a key resource for tracking Cohere's enterprise AI advancements.
Cohere Official Research Blog and Technical Publications
Cohere Blog6 days agoCommentary
The Cohere Research blog serves as the central hub for the company's academic papers and technical breakthroughs. It covers key areas including advanced Retrieval-Augmented Generation (RAG), multilingual embeddings, and robust tool-use capabilities for enterprise agents. This is a key resource for understanding the foundational technology behind Cohere's models.

← PreviousPage 8Next →

Latest in AI

Full Reverse Engineering of the TI-84 Plus Operating System

LocalLLaMA post urges users not to join SpaceX, OpenAI, Anthropic IPOs

Show HN: Gitdot – a better GitHub, open-source, anti-AI, written in Rust

An Implementation of NanoQuant: A Flexible Binary Quantization Method

I bundled a fully local LLM inside my Unity game

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Xiaomi Claims 1,000+ TPS on a 1T Model Using a Standard 8-GPU Server★ 72

Why are so many young people getting cancer? What researchers do and don't know

AI Is Slowing Down

Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax★ 72

World Capitals Voronoi: Redrawing the World Map by Nearest Capital

OpenEnv coordination expands to HF, PyTorch, Unsloth, Modal, and more

[3090] Gemma4 QAT + MTP quick TPS numbers

mtmd adds video input support in llama.cpp★ 72

Gemma 4 Chat Template now has preserve thinking

The crash that vanished: control and emergence in a five-model economy

Google DeepMind RCT in Sierra Leone Shows Gemini's Guided Learning Boosts Education★ 72

What was your local daily driver for coding last week?

llama.cpp PR #24277 avoids KV cell copies in kv-cache

Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76

The Weather and Climate Science AI Revolution Isn't Revolutionary

Cohere and Mila Partner to Advance Quebec French Language and Culture in AI

Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI

Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80

Cohere Blog: Technology Tag Page Overview

Cohere Highlights Thomas Euyang as Research Visual Storyteller

Co/plot: Supporting the Research Process Through Visualization

Cohere's Commitment to Open Science and Collaborative AI Research

Cohere Product Launch Blog Hub

Cohere Official Research Blog and Technical Publications