Latest in AI

Showing:OtherClear ×

🔥 Trending today

anthropic5 amazon3 export-controls3 national-security2 model-access2 open-source2 ai-regulation2 government-policy2 geopolitics2 privacy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Unsloth Gemma 4 QAT MTP assistant models now available
r/LocalLLaMA top day5 days agoRelease
A r/LocalLLaMA post notes that Unsloth’s Gemma 4 QAT MTP assistant models are now available in GGUF format. The root directories include q8_0 files named mtp-gemma-4-*.gguf, while MTP folders contain q8_0 and larger quantized variants. The listed releases cover 12B, 26B-A4B, 31B, E2B, E2B mobile, E4B, and E4B mobile it-qat-GGUF repositories.
It's Not FAANG Anymore. It's MANGOS.
TechCrunch AI5 days agoCommentary
The tech industry's shorthand for power is getting an update. As SpaceX, Anthropic, and OpenAI eye massive public market debuts, a new acronym — MANGOS — is emerging to replace the decade-old FAANG. The shift signals that AI and deep tech companies are becoming the new dominant forces in capital markets, displacing the platform and consumer internet era's giants.
TTS Benchmark Revamped with Objective Standards and Blind ELO Voting (46 Models)
r/LocalLLaMA top day5 days agoBenchmark
Reddit user UkieTechie has revamped their TTS benchmark platform with objective scoring standards and live blind voting, now covering 46 speech synthesis models. Hosted on Hugging Face Space, the arena lets users vote on audio quality without knowing the model name, generating a dynamic ELO leaderboard. The project is open-source on GitHub and welcomes community submissions of new models.
From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI
GitHub Blog5 days agoTutorial
GitHub Copilot CLI now supports custom agents that understand your specific tech stack and team conventions. This feature transforms one-off natural language terminal prompts into standardized, repeatable workflows. It's especially useful for teams wanting consistent, auditable processes for deployments, code review prep, or environment setup.
'Sloppenheimer:' Amazon Employees Mock the Company's AI on Slack
Hacker News (AI keywords)5 days agoCommentary
Amazon employees have been using the term 'Sloppenheimer'—a portmanteau of 'slop' and 'Oppenheimer'—to mock their company's AI products on internal Slack channels. The incident highlights a stark gap between Amazon's aggressive public AI messaging and internal employee skepticism about actual output quality. It reflects a broader industry backlash against AI-generated low-quality content across major tech platforms.
Judge Learns Both Sides Used AI, Cancels Trial, Kicks Everyone Off the Case
Hacker News (AI keywords)5 days agoIncident
In a rare legal incident, a judge found that attorneys on both sides of a case had used AI tools in their legal work. The judge responded by canceling the trial entirely and dismissing all lawyers involved. The case highlights growing judicial frustration with unchecked AI use in court filings and the serious professional consequences that can follow.
Can LLMs Beat Classical Hyperparameter Optimization Algorithms?
Hacker News (AI keywords)5 days agoBenchmark
This paper investigates whether LLMs can serve as effective hyperparameter optimization (HPO) agents, competing with established classical methods such as Bayesian optimization, TPE, and random search. The study likely employs a systematic evaluation framework where LLMs iteratively suggest hyperparameter configurations based on task descriptions and historical evaluation results. Findings aim to clarify the practical potential and limitations of LLMs in AutoML pipelines.
Apple’s AI promises are finally, almost, sort of, here★ 72
The Verge AI5 days agoCommentary
Apple kicked off its annual developer conference with bold AI promises centered around a revamped "Siri AI" and Apple Intelligence. While CEO Tim Cook touted these as boundary-pushing innovations, the announcements largely represent Apple playing catch-up in the generative AI race. The slow, phased rollout suggests Apple is still struggling to match the rapid pace of competitors like Microsoft and Google.
Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85
Google DeepMind Blog5 days agoRelease
Google DeepMind has unveiled Gemma 4 12B, a next-generation open-weights model featuring a unified, encoder-free multimodal architecture. By eliminating the traditional separate vision encoder (such as ViT), it processes diverse modalities directly within a single Transformer network. This design simplifies training, reduces inference latency, and enhances cross-modal alignment, marking a significant milestone for open-source AI.
PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs
Hacker News (AI keywords)5 days agoPaper
This arXiv paper introduces PR-CAD, a framework for controllable and faithful text-to-CAD generation with large language models. It treats CAD creation and editing as one progressive refinement process rather than separate tasks. The authors curate an interaction dataset and report state-of-the-art controllability and faithfulness on public benchmarks.
Google DeepMind Launches Initiative to Power the Future of Robotics in Europe★ 70
Google DeepMind Blog5 days agoBusiness
Google DeepMind has unveiled a strategic initiative to power the future of robotics in Europe. The program focuses on advancing Embodied AI and physical AI through deep collaborations with European academic institutions and industry partners. By combining DeepMind's AI expertise with Europe's strong engineering foundation, the initiative aims to accelerate breakthroughs in robotic generalization and safety.
Apple Announced a New On-Device Inference Engine for Apple Silicon
r/LocalLLaMA top day5 days agoRelease
Apple announced CoreAI at WWDC, which the post frames as a possible future replacement for CoreML and an alternative to MLX, llama.cpp, and torch for optimized on-device inference. Models still need conversion through Python scripts, and current supported models appear mostly from mid-2025. No performance data is available yet; the author expects it may trail MLX on GPU, but Apple’s 20B on-device foundation model claim suggests larger app-bundled models could become possible.
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
Hacker News (AI keywords)5 days agoPaper
Echoing the famous Transformer paper, this work asks whether grep alone is sufficient for agentic search scenarios. The study focuses on 'agent harnesses'—the scaffolding wrapping an LLM, including prompting strategy, tool access, and memory—as the primary driver of search quality. Findings suggest harness design may matter more than the underlying model, challenging the community's focus on model scaling.
Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate
r/LocalLLaMA top day5 days agoNew Tool
Community developer maximecb has published bebelm, a Rust-native, GPU-free inference implementation of Liquid AI's LFM2.5-8B-A1B model, available on crates.io. Decode speed reaches ~37 tokens/s on a Ryzen 7950x with ~7GB memory footprint; prefill is unoptimized and currently similar in speed to decode. The library supports tool-use callbacks, weight sharing across multiple Agent instances with independent KV caches, and Agent cloning to skip repeated prefill on shared prompts.
Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75
Ars Technica AI5 days agoEthics
Apple clarified that running some of its AI models on Google's cloud infrastructure does not compromise user privacy. Through its Private Cloud Compute (PCC) architecture, Apple ensures that all data is processed in secure enclaves with end-to-end encryption. Consequently, Google has zero access to user data, addressing privacy concerns over Apple's cloud partnerships.
Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82
TechCrunch AI5 days agoBusiness
AI software development platform Lovable has surpassed $500 million in annualized run-rate revenue (ARR). The company reports that users are now launching over 1 million new projects per week on the platform. This rapid growth highlights a major shift, with users increasingly leveraging AI to build full-scale businesses and replace legacy internal software.
Jetson Orin NX Build for Hermes Agent + Benchmarking
r/LocalLLaMA top day5 days agoHardware
The post describes turning an unused Jetson Orin NX into a compact local LLM server for Hermes Agent testing. The goals were low noise, over 10 tok/s generation, 300 tok/s prompt processing, at least 65K context, and a custom case. After testing Gemma 4, Qwen 3.6, and many quant variants, the author reports Gemma 4 26B A4B UD Q2_K_XL reaching 66K context and 10.21 tok/s near 60K context.
Learning to lead in a hybrid human-AI enterprise
MIT Tech Review AI5 days agoBusiness
MIT Technology Review says AI agent adoption could surge by as much as 300% over the next two years. Unlike traditional automation that depends on manual input, agents can autonomously coordinate complex tasks across tools and environments. The article frames this as a leadership challenge: organizations must rethink workflows, oversight, roles, and governance for hybrid human-AI enterprises.
NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain
Hugging Face Blog5 days agoNew Tool
NeuroBait is a Hugging Face community project built to help with ADHD task-initiation freeze rather than diagnosis or to-do planning. It fine-tunes google/gemma-3-12b-it with LoRA to produce short, warm, context-aware nudges. The project uses Unsloth and Modal for training, then deploys on a Hugging Face Space with Gradio, transformers, peft, and a runtime LoRA adapter.
ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74
量子位 QbitAI5 days agoRelease
ByteDance’s commercial technology team has open-sourced Bernini, a unified framework for AI video generation and editing. Its design separates semantic planning from visual rendering: an MLLM-based planner understands text, source videos, images, and video references, then a DiT-based renderer produces the final video. The released Bernini-R includes inference code and weights, while the full planner-enabled version is still being prepared.
Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70
量子位 QbitAI5 days agoRelease
Amap has released ABot-Earth 0.5, its latest spatial intelligence model. Moving beyond traditional 2D distillation methods (like Score Distillation Sampling), the model adopts a 3D native driving architecture. This breakthrough addresses multi-view inconsistency and distortion, enabling highly consistent 3D scene generation for autonomous driving simulation, smart cities, and digital twin mapping.
Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event
量子位 QbitAI5 days agoBusiness
According to the title, Yu Ai Wei Wu appeared at Tencent Cloud’s AI industry application conference with a focus on education models and learning Agents. The positioning suggests an effort to apply AI more deeply to personalized learning or teaching workflows. Since the original article text was not provided, specific product features, model architecture, partnerships, and real-world results cannot be verified.
Xiaohongshu Is Growing a GitHub for AI Skills
量子位 QbitAI5 days agoNew Tool
QbitAI reports that Xiaohongshu is testing RED Skill, letting creators attach AI Skills directly under posts. Users can open a Skill page and copy it into assistants such as Codex, Claude Code, or OpenClaw. Nearly 1,000 original Skills have appeared during testing, spanning PPTs, interviews, papers, fitness, travel, and lifestyle use cases, with broader creator rollout expected in July.
A 4B Edge-Deployable Cognitive Model Built in China
量子位 QbitAI5 days agoRelease
QbitAI’s headline says a domestic Chinese team has built a 4B-parameter “cognitive model” suitable for edge deployment. The framing links it to a model direction previously associated with Andrej Karpathy. Since the article body was not provided, details such as the model name, architecture, benchmark results, hardware requirements, open-source status, and licensing remain unverified.
Is a New Player Joining China’s Top-Tier General AI Models?
量子位 QbitAI5 days agoCommentary
Based only on the title, the article likely examines China’s domestic general-purpose AI model landscape and asks whether a new company or model is entering the top tier. It appears to be an industry observation rather than a technical paper or tutorial. Without the full text, the specific model, company, benchmark evidence, and business context cannot be verified.
Voice AI for Greece
ElevenLabs Blog5 days agoBusiness
ElevenLabs published a blog post titled “Voice AI for Greece” on June 9, 2026. Without the article body, the confirmed scope is limited to ElevenLabs, Voice AI, and a Greece-related context. It may be relevant to readers tracking multilingual voice generation, localization, and regional AI adoption, but no specific feature, partnership, or model claim can be verified from the title alone.
ElevenLabs partners with UK Government on voice AI for public services
ElevenLabs Blog5 days agoBusiness
ElevenLabs signed a Memorandum of Understanding with the UK’s DSIT to explore voice AI for public services, accessibility, AI security, and talent development. The work will examine government information access for visually impaired users, older citizens, low-literacy groups, learning differences, and multilingual communities. The company is also expanding in London, moving to a larger HQ and aiming to double UK headcount to 200 this year.
ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR
r/LocalLLaMA top day5 days agoBenchmark
llama.cpp PR #24225 improves ggml-webgpu matrix multiplication performance for k-quants and refactors matmul paths for Q4/Q5/Q8 and k-quants. In pp512 tests on an M2 Pro, reported speedups range from about 1.33x to 3.78x across Q2_K, Q3_K, Q4_K, Q5_K, and Q6_K. The largest gains appear on Q3_K models, including Qwen and Gemma examples.
Why Apple’s slow-and-steady AI bet is starting to look pretty smart
TechCrunch AI5 days agoCommentary
The piece revisits criticism that Apple has fallen behind in the AI race, especially around Siri and Apple Intelligence. It argues that Apple’s slower approach could look smarter as the industry moves beyond flashy demos toward reliable, integrated user experiences. The key idea is that Apple’s ecosystem, device control, privacy positioning, and developer reach may matter more than racing to ship standalone AI chatbots.
JetBrains Mellum 2: a really good and performant model
r/LocalLLaMA top day5 days agoBenchmark
A r/LocalLLaMA user shared informal impressions of JetBrains Mellum 2, focusing on local coding-style tasks and tool calls. On an AMD Radeon RX 7900 XT with llama.cpp Vulkan and 131K context, the model reportedly generated around 111 tokens/s and stayed above 100 tokens/s near full context. The author stresses this is not a scientific benchmark, but a practical workflow-oriented test.

← PreviousPage 5Next →

Latest in AI

Unsloth Gemma 4 QAT MTP assistant models now available

It's Not FAANG Anymore. It's MANGOS.

TTS Benchmark Revamped with Objective Standards and Blind ELO Voting (46 Models)

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

'Sloppenheimer:' Amazon Employees Mock the Company's AI on Slack

Judge Learns Both Sides Used AI, Cancels Trial, Kicks Everyone Off the Case

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

Apple’s AI promises are finally, almost, sort of, here★ 72

Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85

PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs

Google DeepMind Launches Initiative to Power the Future of Robotics in Europe★ 70

Apple Announced a New On-Device Inference Engine for Apple Silicon

Is Grep All You Need? How Agent Harnesses Reshape Agentic Search

Rust-native CPU-only LFM2.5-8B-A1B inference library "bebelm" published as cargo crate

Apple Says Its AI is Still Private Even When Running on Google's Servers★ 75

Lovable Hits $500M Annualized Revenue with 1 Million New Projects Weekly★ 82

Jetson Orin NX Build for Hermes Agent + Benchmarking

Learning to lead in a hybrid human-AI enterprise

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74

Amap Releases ABot-Earth 0.5: Shifting from 2D Distillation to 3D Native for Consistent Scene Generation★ 70

Yu Ai Wei Wu Showcases Education AI Model and Learning Agent at Tencent Cloud Event

Xiaohongshu Is Growing a GitHub for AI Skills

A 4B Edge-Deployable Cognitive Model Built in China

Is a New Player Joining China’s Top-Tier General AI Models?

Voice AI for Greece

ElevenLabs partners with UK Government on voice AI for public services

ggml-webgpu improves prefill speeds for k-quants in llama.cpp PR

Why Apple’s slow-and-steady AI bet is starting to look pretty smart

JetBrains Mellum 2: a really good and performant model