Latest in AI

Showing:StudentsOtherClear ×

🔥 Trending today

anthropic7 export-controls5 model-access3 ai-infrastructure3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Alibaba Launches a Free College Application Agent for Gaokao Students
量子位 QbitAI3 days agoNew Tool
QbitAI reports that Alibaba has released a free Agent for Gaokao college application planning. Based on the title alone, the tool is aimed at China’s 12.9 million exam candidates as they choose universities and majors. No article body was provided, so details such as the product name, underlying model, capabilities, data sources, and usage limits are not stated.
Baidu Launches Industry-First AI Gaokao Application Service with Human Expert Verification
量子位 QbitAI3 days agoNew Tool
Baidu has upgraded its annual Gaokao support services with what it claims is an industry-first AI-driven college application preference filing system. The platform pairs AI-generated university and major recommendations with real human expert verification, directly addressing accuracy risks in high-stakes decisions. The service targets millions of Chinese students who must navigate the complex and irreversible 志愿填报 application process each exam season.
Meta Invests $115 Million to Train Technical Talent as AI Hits White-Collar Jobs
INSIDE 硬塞 AI3 days agoBusiness
Meta is investing $115 million in vocational training as AI disruption pressures white-collar workers. The effort aims to develop blue-collar skills such as electrical and construction-related work needed for AI data center buildouts. The move addresses Meta’s own labor needs while offering a reskilling path for workers affected by automation.
DiffusionGemma: 4x faster text generation★ 74
Google DeepMind Blog4 days agoRelease
Google’s DiffusionGemma is an Apache 2.0 experimental open model using text diffusion instead of standard autoregressive decoding. The 26B MoE model activates 3.8B parameters during inference and is designed for low-latency local workflows. Google claims up to 4x faster generation on dedicated GPUs, while noting that output quality is below standard Gemma 4 and production-quality use cases should still prefer Gemma 4.
Give GitHub Copilot CLI real code intelligence with language servers
GitHub Blog4 days agoTutorial
GitHub’s post shows how to install and configure language servers for GitHub Copilot CLI using the LSP Setup skill. The workflow selects a language, detects the OS, installs the right server, merges configuration, and verifies the setup. With LSP enabled, Copilot CLI can resolve types, jump to definitions, find references, and read hover docs with less reliance on grep or dependency scraping.
Emacs Appearances in Pop Culture
Hacker News (AI keywords)4 days agoCommentary
Based only on the title and metadata, this appears to be a curated or commentary-style post about Emacs references in pop culture. No article body was provided, so specific examples, interpretation, and scope cannot be verified. Its relevance is mainly cultural and historical for developers familiar with Emacs, rather than a current AI, model, or product update.
New to Local LLMs: Overwhelmed by Tool Choices, Model Naming, and Quantization
r/LocalLLaMA top day4 days agoTutorial
A first-time local LLM user installed ollama on Windows with gemma4 and qwen3.6, but quickly hit a wall of confusion around GUI tool selection, model size tradeoffs, and cryptic quantization naming like Q4_K_M and IQ4_XS. Despite owning high-end hardware (RTX 5090, 64GB DDR5, 9950X3D), the user lacks the foundational knowledge to make informed choices. The post highlights ongoing onboarding gaps in the local LLM ecosystem, where fragmented tooling and jargon-heavy documentation create steep barriers for newcomers.
Unsloth Gemma 4 QAT MTP assistant models now available
r/LocalLLaMA top day5 days agoRelease
A r/LocalLLaMA post notes that Unsloth’s Gemma 4 QAT MTP assistant models are now available in GGUF format. The root directories include q8_0 files named mtp-gemma-4-*.gguf, while MTP folders contain q8_0 and larger quantized variants. The listed releases cover 12B, 26B-A4B, 31B, E2B, E2B mobile, E4B, and E4B mobile it-qat-GGUF repositories.
Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event
量子位 QbitAI5 days agoBusiness
According to the title, Yu Ai Wei Wu appeared at Tencent Cloud’s AI industry application conference with a focus on education models and learning Agents. The positioning suggests an effort to apply AI more deeply to personalized learning or teaching workflows. Since the original article text was not provided, specific product features, model architecture, partnerships, and real-world results cannot be verified.
llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants
r/LocalLLaMA top day5 days agoRelease
The Reddit post links to ggml-org/llama.cpp Pull Request #24282, which adds MTP support for Gemma-4 E2B and E4B assistants. The submitter frames it as useful for tiny Gemma models on phones, low-end machines, Raspberry Pi, or similarly constrained devices. The post does not include benchmarks, merge status, or setup instructions, so it should be treated as a development signal rather than a finished release.
Was BitNet a dead end? What happened to ternary LLMs?
r/LocalLLaMA top day5 days agoCommentary
A r/LocalLLaMA user questions whether BitNet and ternary LLMs were a dead end after earlier promise around efficient low-bit models. The post notes that the largest ternary model appears to remain around 2B parameters. It asks why frontier open-weight AI labs are not visibly pursuing the approach, but provides no technical evidence or definitive answer.
Full Reverse Engineering of the TI-84 Plus Operating System
Hacker News (AI keywords)5 days agoHardware
This Hacker News item links to an article titled “Full Reverse Engineering of the TI-84 Plus Operating System.” Based on the provided material, the reliable takeaway is that it concerns reverse engineering the OS of Texas Instruments’ TI-84 Plus graphing calculator. The original text was not provided, so specific claims about methods, findings, code, memory layout, or security implications cannot be verified here.
Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76
Mistral AI News6 days agoRelease
Mistral AI introduced Leanstral, an open-source code agent designed for Lean 4 and formal proof engineering. The model is available through Apache 2.0 weights, Mistral Vibe, and a Labs API endpoint. Mistral positions it as a cost-efficient alternative for verified coding workflows, with FLTEval benchmarks comparing it against Claude family models and large open-source competitors.
Brainμ Supports Science Paper on Memory-Sleep Neural Mechanisms★ 74
量子位 QbitAI6 days agoPaper
BAAI and Tsinghua researchers published a Science study on bidirectional memory-sleep regulation. Brainμ0 supported analysis of sleep EEG and two-photon calcium imaging data, helping identify sleep states and memory-reactivation patterns. The study reports that negative memory reactivation can fragment sleep and increase alertness, while positive memory reactivation may improve sleep continuity and resistance to disturbance.
CVPR 2026 Highlights Guangdong as He Kaiming and GDUT Team Stand Out★ 76
量子位 QbitAI6 days agoPaper
CVPR 2026 named Google DeepMind’s D4RT as Best Paper for fast dynamic 4D scene reconstruction from video. Honorable mentions included Meta’s SAM 3D and NVIDIA’s NitroGen, while TRELLIS.2 won Best Student Paper. The article emphasizes Chinese researcher visibility, ResNet and YOLO receiving the Longuet-Higgins Prize, and a GDUT-led undergraduate-heavy ChordEdit team breaking through among major labs and elite universities.
ChatGPT vs Doubao on Gaokao Math
量子位 QbitAI6 days agoBenchmark
The article appears to test ChatGPT and Doubao on Chinese Gaokao math problems. Since the original text is unavailable, the exact questions, prompts, scores, and winner cannot be verified. It should be treated as a media-style AI capability comparison rather than a rigorous, reproducible benchmark.
Bringing voice AI into the classroom with ElevenLabs
ElevenLabs Blog6 days agoNew Tool
ElevenLabs announced two education-focused initiatives: Impact Program x Professors and an Einstein voice-based learning experience. The professor program offers free Pro-tier access and time-bound student access for courses and projects. The Einstein experience brings his recreated voice to ElevenReader and an AI Agent, letting users listen to or conversationally explore his writings and scientific ideas.
SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students
Hacker News (AI keywords)6 days agoEthics
San Diego State University reportedly deployed around 1,300 AI-enabled cameras across campus, including roughly 330 tied to student dorm areas. The controversy centers on whether students were adequately informed and whether residential common areas should be treated as ordinary surveillance zones. With no full article text provided, the strongest reading is that this is an AI governance and privacy incident, not a model or product launch story.
A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W
Hacker News (AI keywords)6 days agoHardware
This GitHub repository collects Rust Embassy examples for Raspberry Pi Pico 2 and Pico 2 W. Its Matter Wi-Fi light example uses rs-matter, BLE commissioning, and Wi-Fi connectivity so the board can appear as a standard smart bulb in Home Assistant, Apple Home, or Google Home. The project is mainly relevant to embedded Rust and smart-home developers, not AI model users.
"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts
r/LocalLLaMA top day6 days agoCommentary
A popular Reddit post highlights a video demonstrating a "Fully Hallucinated Operating System" run entirely inside an LLM. By prompting the model to act as a terminal, it simulates file systems, network requests, and command execution purely through text generation. While impractical for production, this experiment showcases the impressive state-tracking and "world model" capabilities of modern LLMs.
Show HN: Oproxy - inspect and modify network traffic from the browser
Hacker News (AI keywords)7 days agoNew Tool
Oproxy is a local HTTP, HTTPS, and SOCKS5 proxy with a browser-based management UI. It captures requests and responses, supports replay and Compose workflows, and can export HAR, cURL, Fetch, and Python snippets. Advanced features include HTTPS MITM, mock responses, throttling, breakpoints, DNS overrides, Lua scripts, and an OpenAI-compatible assistant for preparing confirmed proxy changes.
LLM Research Papers: The 2026 List (January to May)
Ahead of AI (Raschka)8 days agoPaper
Sebastian Raschka compiles a curated reference list of LLM papers he bookmarked from January through May 2026. The list is not comprehensive, but organized around topics useful for future articles, lectures, code examples, and research work. Public sections emphasize reasoning, RL, efficient inference, long context, agent systems, tool use, coding agents, diffusion language models, and serving infrastructure.
Thousand Token Wood: shipping a multi-agent economy on a 3B model
Hugging Face Blog8 days agoTutorial
Based on the title, this Hugging Face Blog post presents Thousand Token Wood, a project shipping a multi-agent economy on a 3B model. The likely focus is practical system design under small-model constraints, rather than a new frontier-scale model release. Without the original text, details such as the exact model, architecture, benchmarks, code availability, and results cannot be confirmed.
Warren's Abstract Machine: A Tutorial Reconstruction
Hacker News (AI keywords)8 days agoTutorial
This repository preserves Hassan Ait-Kaci’s out-of-print tutorial on the Warren Abstract Machine, a key execution model for Prolog and logic programming systems. It is not a new AI model or product launch, but a useful historical and educational resource. The material is most relevant to developers and researchers interested in symbolic AI, compilers, unification, backtracking, and logic language runtimes.
Transformers are inherently succinct★ 74
Hacker News (AI keywords)8 days agoPaper
This paper studies transformer expressivity through succinctness: how compactly a formalism describes a language. It proves fixed-precision transformers can be exponentially more succinct than LTL and RNNs, and doubly exponentially more succinct than finite automata. The same succinctness makes verification hard, with basic problems such as emptiness and equivalence shown to be EXPSPACE-complete.
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72
Hacker News (AI keywords)9 days agoRelease
Google released new Gemma 4 checkpoints optimized with Quantization-Aware Training to preserve quality after compression. The release includes Q4_0 checkpoints and a mobile-focused quantization format that can reduce Gemma 4 E2B memory use to about 1GB, or below 1GB for a text-only configuration. The models are available through Hugging Face and supported across llama.cpp, Ollama, LM Studio, LiteRT-LM, Transformers.js, SGLang, vLLM, MLX, and Unsloth.
Ask HN: What is your (AI) dev tech stack / workflow?
Hacker News (AI keywords)9 days agoCommentary
An Ask HN thread asks developers to share their current AI-assisted development setup for upcoming in-person workshops. The author wants guidance for beginners and working developers, with use cases ranging from static sites to FastAPI tools and Linux home automation. Replies cover Claude Code, Cursor, GitHub Copilot, VSCode, spec-driven development, TDD, multi-agent workflows, reviews, and quality control.
Arithmetic Without Numbers: How LLMs Do Math
Hacker News (AI keywords)9 days agoCommentary
The article asks whether LLM arithmetic is memorization, heuristics, real computation, or experimental assistance. It summarizes Rune experiments that decode operations and operands from frozen Llama activations, then route them to Python under a no-parser rule. The strongest supported claim is narrow: activation-derived tool arguments worked in scoped audits, while residual-state JIT replacement, long-number generation, and cross-model transfer remain brittle.
MIT Develops Electrochemical CO2 Capture Using NHI Molecules
INSIDE 硬塞 AI9 days agoPaper
MIT has proposed a new electrochemical carbon capture approach that uses NHI molecules as the adsorbent. Instead of relying on energy-intensive heat-driven processes, the system is powered by electricity. The method could improve efficiency and scalability, but the provided source frames it as a promising research direction rather than a proven commercial deployment.
How LLMs Actually Work
Hacker News (AI keywords)10 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.

Page 1Next →

Latest in AI

Alibaba Launches a Free College Application Agent for Gaokao Students

Baidu Launches Industry-First AI Gaokao Application Service with Human Expert Verification

Meta Invests $115 Million to Train Technical Talent as AI Hits White-Collar Jobs

DiffusionGemma: 4x faster text generation★ 74

Give GitHub Copilot CLI real code intelligence with language servers

Emacs Appearances in Pop Culture

New to Local LLMs: Overwhelmed by Tool Choices, Model Naming, and Quantization

Unsloth Gemma 4 QAT MTP assistant models now available

Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event

llama.cpp PR adds MTP support for Gemma-4 E2B and E4B assistants

Was BitNet a dead end? What happened to ternary LLMs?

Full Reverse Engineering of the TI-84 Plus Operating System

Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76

Brainμ Supports Science Paper on Memory-Sleep Neural Mechanisms★ 74

CVPR 2026 Highlights Guangdong as He Kaiming and GDUT Team Stand Out★ 76

ChatGPT vs Doubao on Gaokao Math

Bringing voice AI into the classroom with ElevenLabs

SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students

A Matter Wi-Fi Light Bulb in Rust on the Raspberry Pi Pico 2 W

"Fully Hallucinated Operating System" Simulates an Entire OS via LLM Prompts

Show HN: Oproxy - inspect and modify network traffic from the browser

LLM Research Papers: The 2026 List (January to May)

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Warren's Abstract Machine: A Tutorial Reconstruction

Transformers are inherently succinct★ 74

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency★ 72

Ask HN: What is your (AI) dev tech stack / workflow?

Arithmetic Without Numbers: How LLMs Do Math

MIT Develops Electrochemical CO2 Capture Using NHI Molecules

How LLMs Actually Work