Latest in AI

🔥 Trending today

anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 open-source2 geopolitics2 python-packaging2 webassembly2 pyodide2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Five things you need to know about AI
MIT Tech Review AI5 days agoCommentary
The article is based on a talk titled “Five things you need to know about AI,” delivered at SXSW London. The author frames it as a guide to the biggest AI themes right now, drawing partly from MIT Technology Review’s first AI10 list. From the provided excerpt, it reads as a trend-oriented editorial overview rather than a product release, paper, or technical tutorial.
Cleaning up after AI rockstar developers
Hacker News (AI keywords)5 days agoOpinion
The post explores the phenomenon of "AI rockstar developers" who use AI tools to write code at breakneck speed. While appearing highly productive, they often introduce significant technical debt and architectural mess. The author highlights the growing burden on teams to clean up this AI-generated code, emphasizing the need for rigorous code review and architectural oversight.
NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain
Hugging Face Blog5 days agoNew Tool
NeuroBait is a Hugging Face community project built to help with ADHD task-initiation freeze rather than diagnosis or to-do planning. It fine-tunes google/gemma-3-12b-it with LoRA to produce short, warm, context-aware nudges. The project uses Unsloth and Modal for training, then deploys on a Hugging Face Space with Gradio, transformers, peft, and a runtime LoRA adapter.
2026 Next-Gen AI (Shenzhen) Entrepreneurship & Innovation Competition Officially Launches
量子位 QbitAI5 days agoBusiness
The "2026 Next-Gen AI (Shenzhen) Entrepreneurship & Innovation Competition" has officially kicked off, aiming to gather global AI startups and innovative projects. Leveraging Shenzhen's robust supply chain and hardware advantages, the event offers potential funding, policy support, and market matchmaking. This initiative marks another step forward for the Greater Bay Area in accelerating AI application deployment and ecosystem growth.
ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74
量子位 QbitAI5 days agoRelease
ByteDance’s commercial technology team has open-sourced Bernini, a unified framework for AI video generation and editing. Its design separates semantic planning from visual rendering: an MLLM-based planner understands text, source videos, images, and video references, then a DiT-based renderer produces the final video. The released Bernini-R includes inference code and weights, while the full planner-enabled version is still being prepared.
Ant Group Launches Overseas AI Payment Solution to Enable Global AI Agent Operations★ 70
量子位 QbitAI5 days agoRelease
Ant Group has introduced a new overseas AI payment solution designed to bridge the gap between AI agents and global transactions. The solution allows merchants to deploy AI agents that can directly process cross-border payments, creating a seamless transactional loop. This move is expected to accelerate the "Agent Economy" by turning AI assistants into revenue-generating entities.
Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70
量子位 QbitAI5 days agoRelease
Amap has released ABot-Earth 0.5, its latest spatial intelligence model. Moving beyond traditional 2D distillation methods (like Score Distillation Sampling), the model adopts a 3D native driving architecture. This breakthrough addresses multi-view inconsistency and distortion, enabling highly consistent 3D scene generation for autonomous driving simulation, smart cities, and digital twin mapping.
Tencent Wants Enterprises to Access AI in Only One Way
量子位 QbitAI5 days agoBusiness
The original article text is unavailable, so this can only be inferred from the headline. It likely discusses Tencent’s attempt to make enterprise AI adoption revolve around a single platform, entry point, or workflow. The key implication is business-strategic rather than technical: enterprise AI competition may be shifting from standalone models to integrated, managed platforms.
A 4B Edge-Deployable Cognitive Model Built in China
量子位 QbitAI5 days agoRelease
QbitAI’s headline says a domestic Chinese team has built a 4B-parameter “cognitive model” suitable for edge deployment. The framing links it to a model direction previously associated with Andrej Karpathy. Since the article body was not provided, details such as the model name, architecture, benchmark results, hardware requirements, open-source status, and licensing remain unverified.
Xiaohongshu Is Growing a GitHub for AI Skills
量子位 QbitAI5 days agoNew Tool
QbitAI reports that Xiaohongshu is testing RED Skill, letting creators attach AI Skills directly under posts. Users can open a Skill page and copy it into assistants such as Codex, Claude Code, or OpenClaw. Nearly 1,000 original Skills have appeared during testing, spanning PPTs, interviews, papers, fitness, travel, and lifestyle use cases, with broader creator rollout expected in July.
DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions
量子位 QbitAI5 days agoHardware
QbitAI reports that DeepSeek has listed an IDC design and planning engineer role covering data center campuses, power, cooling, networking, and capacity planning. The job description mentions participation in MW-to-GW-scale infrastructure and technologies such as dense GPU clusters, liquid cooling, smart operations, and digital twins. The article interprets this as a sign that DeepSeek may be moving beyond rented compute toward self-built AI infrastructure.
Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event
量子位 QbitAI5 days agoBusiness
According to the title, Yu Ai Wei Wu appeared at Tencent Cloud’s AI industry application conference with a focus on education models and learning Agents. The positioning suggests an effort to apply AI more deeply to personalized learning or teaching workflows. Since the original article text was not provided, specific product features, model architecture, partnerships, and real-world results cannot be verified.
Is a New Player Joining China’s Top-Tier General AI Models?
量子位 QbitAI5 days agoCommentary
Based only on the title, the article likely examines China’s domestic general-purpose AI model landscape and asks whether a new company or model is entering the top tier. It appears to be an industry observation rather than a technical paper or tutorial. Without the full text, the specific model, company, benchmark evidence, and business context cannot be verified.
Voice AI for Greece
ElevenLabs Blog5 days agoBusiness
ElevenLabs published a blog post titled “Voice AI for Greece” on June 9, 2026. Without the article body, the confirmed scope is limited to ElevenLabs, Voice AI, and a Greece-related context. It may be relevant to readers tracking multilingual voice generation, localization, and regional AI adoption, but no specific feature, partnership, or model claim can be verified from the title alone.
ElevenLabs partners with UK Government on voice AI for public services
ElevenLabs Blog5 days agoBusiness
ElevenLabs signed a Memorandum of Understanding with the UK’s DSIT to explore voice AI for public services, accessibility, AI security, and talent development. The work will examine government information access for visually impaired users, older citizens, low-literacy groups, learning differences, and multilingual communities. The company is also expanding in London, moving to a larger HQ and aiming to double UK headcount to 200 this year.
Microsoft's open source tools were hacked to steal passwords of AI developers★ 78
Hacker News (AI keywords)5 days agoIncident
Microsoft temporarily removed several open source GitHub projects while investigating suspected malicious content. The affected repos were linked to Azure and developer workflows involving AI coding tools such as Claude Code, Gemini CLI, and VS Code. Security researchers said the malware could steal passwords and sensitive credentials when compromised tools were opened, though Microsoft has not disclosed how many users were affected.
Claude Fable 5 Now Available on Vercel AI Gateway
Vercel Changelog5 days agoRelease
Vercel has added Claude Fable 5 to its AI Gateway, enabling developers to call Anthropic's newest flagship model through a unified API proxy without managing separate credentials. AI Gateway handles cross-provider routing, monitoring, and cost tracking — upgrading to Fable 5 requires only a model ID change. This routine availability update lowers the barrier for Vercel-hosted AI apps to adopt the latest model capabilities in production.
FrontierCode: Benchmarking for Code Quality over Slop
Latent Space6 days agoBenchmark
Latent Space briefly announced FrontierCode with the line “We made a thing!” From the title, FrontierCode appears to be a benchmark for frontier coding systems that prioritizes code quality rather than sheer code generation volume. The provided excerpt does not include methodology, model results, datasets, or tooling details, so conclusions should remain cautious.
Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70
Cloudflare Blog6 days agoCommentary
Cloudflare introduces its defense architecture under Project Glasswing, arguing that robust architectural defense around vulnerabilities is more critical than patching speed. By acting as its own "customer zero," Cloudflare demonstrates how to mitigate autonomous frontier cyber models through edge-based isolation, zero-trust principles, and proactive traffic filtering.
L'Affaire Siloxane
Hacker News (AI keywords)6 days agoCommentary
Pinboard founder and prominent tech critic Maciej Cegłowski published a piece titled in the style of historical French scandals, suggesting a serious controversy worth scrutiny. The word 'Siloxane' — a silicon-oxygen chemical compound and basis of silicone — likely serves as a metaphor or pseudonym for a tech or AI entity. Original article content was unavailable; details must be confirmed by reading the source directly.
Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?
r/LocalLLaMA top day6 days agoBenchmark
A r/LocalLLaMA user is looking for benchmarks comparing Gemma 4 4-bit QAT models, via Unsloth, against standard 8-bit non-QAT quantized models. They understand QAT is expected to preserve much of the BF16 baseline accuracy, but want hard numbers against traditional 8-bit PTQ. The post highlights scattered feedback but no clear head-to-head evaluation yet.
ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR
r/LocalLLaMA top day6 days agoBenchmark
llama.cpp PR #24225 improves ggml-webgpu matrix multiplication performance for k-quants and refactors matmul paths for Q4/Q5/Q8 and k-quants. In pp512 tests on an M2 Pro, reported speedups range from about 1.33x to 3.78x across Q2_K, Q3_K, Q4_K, Q5_K, and Q6_K. The largest gains appear on Q3_K models, including Qwen and Gemma examples.
Why Apple’s slow-and-steady AI bet is starting to look pretty smart
TechCrunch AI6 days agoCommentary
The piece revisits criticism that Apple has fallen behind in the AI race, especially around Siri and Apple Intelligence. It argues that Apple’s slower approach could look smarter as the industry moves beyond flashy demos toward reliable, integrated user experiences. The key idea is that Apple’s ecosystem, device control, privacy positioning, and developer reach may matter more than racing to ship standalone AI chatbots.
Packed twin inference doubles Qwen3.6-27B throughput on one MI50
r/LocalLLaMA top day6 days agoBenchmark
A LocalLLaMA user shared an early packed-twin-inference experiment for local LLM acceleration. The idea resembles speculative decoding, but uses the same quantized model side-by-side instead of a smaller draft model. On a single AMD MI50, the author reports Qwen3.6-27B improving from 19.4 to 38.1 tk/s, with Q8-or-lower quantization as the main target.
JetBrains Mellum 2: a really good and performant model
r/LocalLLaMA top day6 days agoBenchmark
A r/LocalLLaMA user shared informal impressions of JetBrains Mellum 2, focusing on local coding-style tasks and tool calls. On an AMD Radeon RX 7900 XT with llama.cpp Vulkan and 131K context, the model reportedly generated around 111 tokens/s and stayed above 100 tokens/s near full context. The author stresses this is not a scientific benchmark, but a practical workflow-oriented test.
Mercor’s Brendan Foody calls out Sequoia over dual-pricing valuation tricks
TechCrunch AI6 days agoBusiness
TechCrunch reports that Mercor’s Brendan Foody called out Sequoia over alleged dual-pricing valuation practices. The article says Sequoia is one of several top firms that sell the same equity at two different prices. The story centers on transparency, valuation signaling, and how AI startup equity may be priced in venture markets.
Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72
r/LocalLLaMA top day6 days agoRelease
Omi Health’s founder says he fine-tuned NVIDIA Parakeet TDT 0.6B v2 for clinical speech and released Omi Med STT v1 under CC-BY-4.0. The runtime supports Mac, Windows, and Linux, auto-selecting MLX, NeMo, or GGUF/parakeet.cpp backends. In the author’s held-out medical benchmark, it reports 2.37% medical-WER and 145× realtime on local A10 compute.
A llama.cpp CLI Command Builder
r/LocalLLaMA top day6 days agoNew Tool
A r/LocalLLaMA post introduces a llama.cpp CLI Command Builder with no accounts, email, pop-ups, cookies, or ads. It stores information locally in the browser and includes editable fields for flags and arguments found in the documentation. Users can build CLI or server commands, log run information, and compare which configurations work best for their hardware; only Linux is currently supported.
Budgets for API Keys on Vercel AI Gateway
Vercel Changelog6 days agoRelease
Vercel has added per-API-key budget controls to its AI Gateway product, enabling developers to set hard spending limits on individual keys. Once a key hits its budget threshold, the gateway automatically blocks further requests, preventing unexpected cost overruns. This is especially useful for multi-tenant apps, team cost allocation, and isolating dev/test environments from production spending.
Domain Search is now available through the Vercel CLI
Vercel Changelog6 days agoRelease
Vercel has added domain search functionality to its CLI, enabling developers to query domain availability directly from the command line. Previously, this required switching to the Vercel web dashboard, adding friction to deployment workflows. The update keeps more actions within the terminal, reducing context-switching for keyboard-driven developers.

← PreviousPage 13Next →

Latest in AI

Five things you need to know about AI

Cleaning up after AI rockstar developers

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

2026 Next-Gen AI (Shenzhen) Entrepreneurship & Innovation Competition Officially Launches

ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74

Ant Group Launches Overseas AI Payment Solution to Enable Global AI Agent Operations★ 70

Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70

Tencent Wants Enterprises to Access AI in Only One Way

A 4B Edge-Deployable Cognitive Model Built in China

Xiaohongshu Is Growing a GitHub for AI Skills

DeepSeek hires IDC planners, hinting at MW-to-GW data center ambitions

Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event

Is a New Player Joining China’s Top-Tier General AI Models?

Voice AI for Greece

ElevenLabs partners with UK Government on voice AI for public services

Microsoft's open source tools were hacked to steal passwords of AI developers★ 78

Claude Fable 5 Now Available on Vercel AI Gateway

FrontierCode: Benchmarking for Code Quality over Slop

Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70

L'Affaire Siloxane

Anyone seen benchmarks comparing Gemma 4 4-bit QAT vs. 8-bit standard quants?

ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR

Why Apple’s slow-and-steady AI bet is starting to look pretty smart

Packed twin inference doubles Qwen3.6-27B throughput on one MI50

JetBrains Mellum 2: a really good and performant model

Mercor’s Brendan Foody calls out Sequoia over dual-pricing valuation tricks

Omi Med STT v1: Open-Weight Medical ASR Fine-Tuned from Parakeet 0.6B★ 72

A llama.cpp CLI Command Builder

Budgets for API Keys on Vercel AI Gateway

Domain Search is now available through the Vercel CLI