Latest in AI

Showing:ResearchersClear ×

🔥 Trending today

export-controls2 ai-policy2 anthropic2 open-source2 geopolitics2 python-packaging2 webassembly2 pyodide2 ipo1 ai-investment1

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Leak Reveals Microsoft Wants Its AI to Be 'Addictive'
Hacker News (AI keywords)9 days agoEthics
The provided source only includes the headline, so the claim should be treated cautiously. It suggests leaked material says Microsoft wants its AI products to become “addictive,” raising questions about engagement-driven AI design. Without the article text, the exact product, document context, Microsoft response, and meaning of “addictive” cannot be verified.
The Quiet Numbers Station: Decoding Nineteen Years of GPS Cryptography
Hacker News (AI keywords)9 days agoPaper
Published on UCL's Bentham's Gaze blog, this research analyzes GPS cryptographic signals over a 19-year span, likening the satellites to 'quiet numbers stations.' The authors explore the evolution of GPS encryption (such as military P(Y) code and civilian authentication), evaluating their cryptographic strength and potential vulnerabilities using modern computational analysis.
Did Claude Increase Bugs in rsync?
Hacker News (AI keywords)9 days agoBenchmark
The article analyzes rsync releases to test whether versions containing Claude commits had unusually high bug rates. It uses severity-weighted bugs per 10 commits, exact permutation testing, and Fisher's exact test. With only two Claude-exposed releases, the evidence is limited, but both releases appear within normal historical variation rather than clear negative outliers.
Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG★ 72
Google Research Blog9 days agoRelease
Google Research and Google Cloud introduced an agentic RAG framework hosted on Gemini Enterprise Agent Platform. It uses multiple agents to plan, rewrite, route, retrieve, verify sufficient context, iterate, and synthesize answers. Google reports up to 34% factuality accuracy gains over standard RAG, plus 90.1% accuracy in a cross-corpus FramesQA setting with similar latency to single-corpus retrieval.
Quoting Andreas Kling
Simon Willison's Weblog9 days agoEthics
Simon Willison quotes Andreas Kling explaining Ladybird’s decision to stop accepting public pull requests. Kling argues that large patches once implied substantial effort, which could serve as a proxy for good faith, but generative AI has weakened that assumption. His central point is not whether code was typed by hand, but who takes responsibility for code once it enters a browser intended for real users.
Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy
INSIDE 硬塞 AI9 days agoBusiness
Anthropic co-founder and Anthropic Labs lead Ben Mann made his first visit to Taiwan, according to INSIDE. The report highlights his role in leading Claude Code and the Model Context Protocol, two key parts of Anthropic’s developer-focused product direction. The discussion centered on Claude strategy, AI safety boundaries, jobs, and Taiwan’s strategic role in the AI landscape.
Arithmetic Without Numbers: How LLMs Do Math
Hacker News (AI keywords)9 days agoCommentary
The article asks whether LLM arithmetic is memorization, heuristics, real computation, or experimental assistance. It summarizes Rune experiments that decode operations and operands from frozen Llama activations, then route them to Python under a no-parser rule. The strongest supported claim is narrow: activation-derived tool arguments worked in scoped audits, while residual-state JIT replacement, long-number generation, and cross-model transfer remain brittle.
MIT Develops Electrochemical CO2 Capture Using NHI Molecules
INSIDE 硬塞 AI9 days agoPaper
MIT has proposed a new electrochemical carbon capture approach that uses NHI molecules as the adsorbent. Instead of relying on energy-intensive heat-driven processes, the system is powered by electricity. The method could improve efficiency and scalability, but the provided source frames it as a promising research direction rather than a proven commercial deployment.
Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens
Hacker News (AI keywords)9 days agoNew Tool
This Show HN post introduces Lowfat, described only by its title as a pluggable CLI filter. The stated value proposition is reducing LLM token usage, with the author claiming it saved 91.8% of their tokens. Without the original body text, implementation details, supported workflows, model compatibility, and the generality of the savings claim cannot be verified.
The Meta hack shows there’s more to AI security than Mythos★ 74
MIT Tech Review AI10 days agoIncident
Attackers reportedly used Meta’s AI customer support agent to hijack Instagram accounts by asking it to link accounts to attacker-controlled emails. MIT Technology Review frames the incident as a reminder that AI security is not only about powerful future systems like Mythos. The immediate risk is giving AI agents sensitive operational powers without strong authentication, permissions, review, and testing.
ESP32 Bit Pirate, a Hardware Hacking Tool with WebCLI
Hacker News (AI keywords)10 days agoHardware
ESP32 Bit Pirate is presented as a hardware hacking tool built around ESP32 with a WebCLI interface. The title suggests browser-accessible command-line control for interacting with hardware protocols. Because no article body is available, supported protocols, maturity, documentation quality, and practical use cases cannot be verified.
Show HN: I Derived a Pancake
Hacker News (AI keywords)10 days agoTutorial
This Show HN post is not an AI tool, but a science-heavy cooking project. It frames pancakes as a chemistry and ratio problem involving leavening, acid-base stoichiometry, protein structure, and CO2 generation. The apparent value is an interactive calculator that helps derive predictable pancake proportions from available ingredients.
Fine-tuning an LLM to write docs like it's 1995
Hacker News (AI keywords)10 days agoTutorial
The author builds a corpus from old Microsoft manuals, cleans OCR text, generates instruction-style JSONL examples, and fine-tunes Llama 3.1 8B and Qwen 2.5 7B with QLoRA. Tests cover malloc(), a fictional Win32 API, and a deliberately anachronistic REST API prompt. Qwen fine-tunes transfer the period documentation style best, but the experiment also shows hallucination risks, tuning complexity, and why these models augment rather than replace technical writers.
The Pentagon Is Running an AI Propaganda Mill Targeting Latin America
Hacker News (AI keywords)10 days agoEthics
The Intercept says a site called La Tilde presents itself as a Latin American media brand while publishing content aligned with U.S. military messaging. The outlet reportedly mixes lifestyle and finance articles with pieces praising U.S. actions in the region. The case raises concerns about AI-generated media, covert influence operations, source transparency, and the blurred line between journalism and state propaganda.
Magenta RealTime 2: An Open, Locally Runnable Real-Time Music Model★ 74
Hacker News (AI keywords)10 days agoRelease
Magenta RealTime 2 is an open-weights live music model designed for interactive performance rather than offline prompt-to-song generation. It supports real-time control through MIDI, audio, and text, and can run as standalone apps, DAW plugins, or embedded music software. Google Magenta also released a Python library, C++ MLX inference engine, models, and example applications for musicians and developers.
From Lab to Market: How FITI Helps Nearly 600 Startups Take Their First Step
INSIDE 硬塞 AI10 days agoBusiness
INSIDE introduces Taiwan’s FITI program as a bridge between academic research and startup commercialization. The program helps research teams build business thinking and market connection capabilities through mentors, courses, and supporting resources. Its focus is helping technology-driven teams shorten the gap between laboratory research and market entry, with the article highlighting FITI’s role in accompanying nearly 600 startups through their earliest entrepreneurial steps.
AI enthusiasts are in a race against time, AI skeptics are in a race against entropy
Simon Willison's Weblog10 days agoCommentary
Simon Willison highlights Charity Majors’ framing of AI enthusiasts and skeptics as both responding to real existential threats. Enthusiasts see teams gaining discontinuous capability by leaning into AI, making inaction dangerous in competitive markets. Skeptics see faster code production eroding shared understanding, reliability, institutional knowledge, and on-call sustainability. The core challenge is organizational: there is no natural feedback loop connecting these perspectives.
South Korean forums may need AI scans for uploaded images and videos
Hacker News (AI keywords)10 days agoRegulation
A Privacy Guides community post says South Korean forums and online communities may be required to scan user-uploaded images and videos with AI under telecom-related rules. The post claims operators must provide their own hardware, including costly Nvidia GPUs. The debate centers on illegal sexual imagery and CSAM prevention, but also raises concerns about prior censorship, false positives, free expression, and burdens on small domestic communities.
Ask HN: What was your "oh shit" moment with GenAI?
Hacker News (AI keywords)10 days agoCommentary
This Hacker News Ask HN post asks the community to share the moment GenAI felt unexpectedly powerful, disruptive, or concerning. Since no body text or comments were provided, only the topic can be summarized safely. Its value lies in surfacing practitioner reactions and lived experiences around GenAI’s impact, rather than reporting a concrete launch, paper, benchmark, or incident.
The Skeptic’s Guide to Humanoid Robots Going Viral on the Internet
Ars Technica AI10 days agoCommentary
The article warns that viral humanoid robot demonstrations can distort public perception of robotics progress. Carefully staged or selectively shown clips may make systems appear more autonomous, reliable, or deployment-ready than demonstrated evidence supports. The useful takeaway is to separate impressive demos from repeatable real-world capability, especially when evaluating hype, investment narratives, or product claims.
Show HN: Formally verified polygon intersection, Opus 4.8 one-shot
Hacker News (AI keywords)10 days agoNew Tool
This GitHub project presents a formally verified multipolygon intersection algorithm checked in Lean 4. The author argues trust comes from the Lean checker and a small human-reviewed specification, not from trusting LLM output directly. It also documents how Claude Opus versions improved on Lean proof work, with Opus 4.8 reportedly completing larger proof strategies that earlier attempts could not.
Co-Existence and the End of Co-Intelligence
One Useful Thing (Mollick)10 days agoCommentary
Ethan Mollick’s One Useful Thing post announces or frames Co-Existence, the follow-up to Co-Intelligence. The core shift is from prompting chatbots as collaborators toward living and working alongside increasingly embedded AI systems. It is best read as commentary and book positioning, not a technical release, benchmark, or tool tutorial.
These LLMs are the best at resisting Russian propaganda
Ars Technica AI10 days agoBenchmark
Ars Technica reports on an Estonian government benchmark evaluating how large language models handle Russian propaganda. The test focuses on whether dozens of models resist, repeat, or normalize Russia’s strategic narratives. The topic matters for governments, researchers, and AI builders because LLMs are increasingly used to summarize and mediate public information.
Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs
Latent Space10 days agoBenchmark
Latent Space talks with Lukas Petersson and Axel Backlund of Andon Labs, the authors behind VendingBench. The episode focuses on evaluating Claude models across a range from Haiku to Mythos. It also discusses how they build frontier evals from scratch, with an emphasis on creating benchmarks that remain useful and meaningful over time.
Elon Musk tries again to escape FTC audits of X data handling
Ars Technica AI10 days agoRegulation
Ars Technica reports that Elon Musk is again seeking to escape FTC audits over how X handles user data. Public commenters warned the FTC that Musk cannot be trusted to protect X users’ privacy. The story centers on platform governance, privacy oversight, and whether external audits should remain in place for X’s data practices.
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Hugging Face Blog10 days agoRelease
NVIDIA’s Nemotron 3.5 Content Safety is positioned as a customizable multimodal safety layer for global enterprise AI. Based on the title, it appears focused on content moderation and policy enforcement across AI applications, potentially including text and visual contexts. Without the full article, details such as benchmarks, licensing, supported languages, deployment paths, and model specifications should not be assumed.
Google Requested a Slightly Different AI Statement
Simon Willison's Weblog10 days agoEthics
Simon Willison quotes Emanuel Maiberg of 404 Media about a post-publication request from Google. After the story ran, Google asked the outlet to publish a slightly different version of its statement. The notable change: the revised statement no longer said it was critical to maintain humans in the loop, raising questions about corporate AI accountability language.
The LLM warnings Google fired Timnit Gebru over have all come true
Hacker News (AI keywords)10 days agoEthics
The post frames Timnit Gebru’s dispute with Google as an early warning about large language model risks. Based on the available title, it appears to argue that concerns around bias, accountability, concentration of power, and deployment risks have since become visible in practice. This is best read as AI ethics commentary, not a model release or technical tutorial.
Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.
TechCrunch AI10 days agoHardware
Hello Robot has released Stretch 4, the fourth generation of its home assistance robot. The company is taking a cautious, deployment-first approach, using a wheeled base, telescoping arm, sensors, and human-in-the-loop control rather than promising a general-purpose humanoid. TechCrunch frames Stretch as a practical bet on real household data, assistive use cases, and safer hardware for people with mobility challenges.
How some data center operators are tackling their water use problems
Ars Technica AI10 days agoBusiness
Ars Technica examines how hyperscalers and data center operators are facing pressure over water use. The issue centers on local water availability and quality as AI infrastructure expands. The provided excerpt says some operators are trying to address the problem, but does not specify companies, methods, or measured results.

← PreviousPage 13Next →

Latest in AI

Leak Reveals Microsoft Wants Its AI to Be 'Addictive'

The Quiet Numbers Station: Decoding Nineteen Years of GPS Cryptography

Did Claude Increase Bugs in rsync?

Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG★ 72

Quoting Andreas Kling

Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy

Arithmetic Without Numbers: How LLMs Do Math

MIT Develops Electrochemical CO2 Capture Using NHI Molecules

Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens

The Meta hack shows there’s more to AI security than Mythos★ 74

ESP32 Bit Pirate, a Hardware Hacking Tool with WebCLI

Show HN: I Derived a Pancake

Fine-tuning an LLM to write docs like it's 1995

The Pentagon Is Running an AI Propaganda Mill Targeting Latin America

Magenta RealTime 2: An Open, Locally Runnable Real-Time Music Model★ 74

From Lab to Market: How FITI Helps Nearly 600 Startups Take Their First Step

AI enthusiasts are in a race against time, AI skeptics are in a race against entropy

South Korean forums may need AI scans for uploaded images and videos

Ask HN: What was your "oh shit" moment with GenAI?

The Skeptic’s Guide to Humanoid Robots Going Viral on the Internet

Show HN: Formally verified polygon intersection, Opus 4.8 one-shot

Co-Existence and the End of Co-Intelligence

These LLMs are the best at resisting Russian propaganda

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Elon Musk tries again to escape FTC audits of X data handling

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Google Requested a Slightly Different AI Statement

The LLM warnings Google fired Timnit Gebru over have all come true

Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.

How some data center operators are tackling their water use problems