Latest in AI

Showing:DevelopersClear ×

🔥 Trending today

open-source3 anthropic3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations
r/LocalLLaMA top day5 days agoRelease
SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.
GPT-2: Too Dangerous To Release — A 2019 Retrospective
Hacker News (AI keywords)5 days agoCommentary
In 2019, OpenAI staged the release of GPT-2, citing fears it could enable large-scale disinformation and spam generation. The move sparked debate: was it responsible AI safety practice or a savvy PR stunt? Written in late 2022, this blog post revisits the episode now that GPT-2 looks quaint compared to GPT-3/4, asking whether the original fears were justified.
Releasing Cohere North Mini Code
r/LocalLLaMA top day5 days agoRelease
Cohere’s Jay Alammar announced the official release of North Mini Code after early community feedback from r/LocalLLaMA. Weights are available on Hugging Face, including an fp8 version, and the model can be tried for free through OpenCode. For vLLM deployment, Cohere recommends using vLLM main for now and installing cohere_melody for accurate response parsing, while noting community requests for quantization and llama.cpp support.
Anthropic Requires Fable and Mythos Models to Retain Data for 30 Days★ 74
Hacker News (AI keywords)5 days agoEthics
Anthropic says Mythos-class models require limited prompt and output retention for trust and safety work across platforms where they are offered. The policy took effect on June 9, 2026 and mainly affects organizations using Zero Data Retention through Claude Console, Claude Code Enterprise, AWS Bedrock, Google Cloud Agent Platform, or Microsoft Foundry. Consumer Claude Free, Pro, and Max plans are unchanged, while Anthropic describes restricted human review and automatic deletion after 30 days.
Watch agents fight: a live challenge to speed up Gemma 4 E4B inference on a single A10G
r/LocalLLaMA top day5 days agoBenchmark
A public HuggingFace Spaces dashboard hosts a live competition where AI agents race to optimize Gemma 4 E4B inference throughput on a single NVIDIA A10G GPU. The challenge gamifies ML inference engineering, letting anyone watch agents explore quantization and scheduling strategies in real time. Optimization recipes surfaced by the competition offer practical value for developers targeting single-GPU self-hosted Gemma 4 deployments.
What it feels like to work with Mythos
One Useful Thing (Mollick)5 days agoCommentary
Ethan Mollick of One Useful Thing shares his personal experience working with Mythos, a project tied to Claude Fable. His central claim is that Claude Fable represents another significant, qualitative leap in AI capability rather than an incremental update. Writing from a knowledge-worker perspective rather than a purely technical one, Mollick's assessment serves as an early signal for practitioners evaluating whether this model meaningfully changes how they work.
Anthropic Releases Claude Fable 5, Its First Public Mythos-Class Model, With Guardrails for High-Risk Domains★ 76
TechCrunch AI5 days agoRelease
Anthropic has released Claude Fable 5, marking the first time a model from its high-capability Mythos family is available to the general public. The model includes built-in guardrails that restrict responses in high-risk domains such as cybersecurity and biology to mitigate misuse potential. The launch comes just days after Anthropic publicly warned that AI technology is becoming increasingly and alarmingly dangerous.
Anthropic Releases Claude Fable 5, Its First Mythos-Class Model★ 78
The Verge AI5 days agoRelease
Anthropic has released Claude Fable 5, the company's most powerful model ever made widely available and its first under the new 'Mythos' model class. The model shows exceptional performance across software engineering, knowledge work, and vision tasks. Its advantage over competing models reportedly grows wider as tasks increase in length and complexity, making it particularly suited for demanding, multi-step workloads.
System Card: Claude Fable 5 and Claude Mythos 5★ 82
Hacker News (AI keywords)5 days agoRelease
Anthropic has published system cards for its two newest flagship models, Claude Fable 5 and Claude Mythos 5, following its standard responsible-release practice. These documents cover dangerous capability evaluations, ASL safety-level determinations, red-teaming results, and alignment assessments under the company's Responsible Scaling Policy. They serve as primary references for safety researchers, enterprise buyers, regulators, and developers assessing model risk and deployment suitability.
Anthropic Launches Claude Fable 5★ 85
Hacker News (AI keywords)5 days agoRelease
Anthropic announced Claude Fable 5 on June 9, 2026, marking a new naming generation beyond the Claude 4.X family. The announcement URL also references 'Mythos 5,' suggesting a companion model may be included in this release. With model ID claude-fable-5, this is Anthropic's most current model and relevant to developers, researchers, and enterprise users integrating Claude APIs.
Launch HN: Transload (YC P26) – Measuring Freight Items with CCTV
Hacker News (AI keywords)5 days agoNew Tool
Transload is a Y Combinator P26 startup that applies computer vision to existing CCTV footage to automatically calculate freight item dimensions, eliminating manual measurement or expensive dedicated hardware. The approach lowers adoption barriers for warehouses and logistics operators by repurposing infrastructure already in place. The team launched on Hacker News to gather early feedback from the developer and logistics community.
Cohere North Mini Code 1.0
r/LocalLLaMA top day5 days agoRelease
CohereLabs’ North Mini Code 1.0 appears to have moved from early access to final release, with weights available on Hugging Face. The Reddit post describes it as a 30B A3B coding model. Its Artificial Analysis overall score of 28 trails Qwen 3.6 35B at 43, but its coding index score of 33 is close to Qwen’s 35 and above Gemma 4 26B’s 22.
Unsloth Gemma 4 QAT MTP assistant models now available
r/LocalLLaMA top day5 days agoRelease
A r/LocalLLaMA post notes that Unsloth’s Gemma 4 QAT MTP assistant models are now available in GGUF format. The root directories include q8_0 files named mtp-gemma-4-*.gguf, while MTP folders contain q8_0 and larger quantized variants. The listed releases cover 12B, 26B-A4B, 31B, E2B, E2B mobile, E4B, and E4B mobile it-qat-GGUF repositories.
TTS Benchmark Revamped with Objective Standards and Blind ELO Voting (46 Models)
r/LocalLLaMA top day5 days agoBenchmark
Reddit user UkieTechie has revamped their TTS benchmark platform with objective scoring standards and live blind voting, now covering 46 speech synthesis models. Hosted on Hugging Face Space, the arena lets users vote on audio quality without knowing the model name, generating a dynamic ELO leaderboard. The project is open-source on GitHub and welcomes community submissions of new models.
From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI
GitHub Blog5 days agoTutorial
GitHub Copilot CLI now supports custom agents that understand your specific tech stack and team conventions. This feature transforms one-off natural language terminal prompts into standardized, repeatable workflows. It's especially useful for teams wanting consistent, auditable processes for deployments, code review prep, or environment setup.
Introducing North Mini Code: Cohere's First Model For Developers
Hugging Face Blog5 days agoRelease
Cohere officially introduces North Mini Code, the first model in its North lineup explicitly aimed at developers rather than enterprise API customers. The 'Mini' designation signals a compact, cost-efficient model suited for IDE integrations, CLI tools, and real-time code completion. This marks a strategic expansion for Cohere beyond B2B into the broader developer tooling ecosystem.
FCC Wants to Kill Burner Phones by Forcing Telecoms to Verify All Customers' IDs
Hacker News (AI keywords)5 days agoRegulation
The FCC is proposing rules that would require telecom carriers to verify the identity of every customer before activating service. This move would eliminate anonymous prepaid 'burner phones,' long used by journalists, domestic abuse survivors, and privacy-conscious individuals. Critics warn the policy could undermine digital privacy and disproportionately harm vulnerable populations, while proponents argue it would curb fraud and criminal activity.
Fluid, natural voice translation with Gemini 3.5 Live Translate
Google DeepMind Blog5 days agoRelease
Google DeepMind has released Gemini 3.5 Live Translate, bringing near real-time and naturally flowing voice translation to three major Google platforms. The feature integrates into Google AI Studio for developers, Google Translate for general users, and Google Meet for remote collaboration. The emphasis on naturalness — not just speed — marks a meaningful step forward for AI-powered multilingual communication.
Can LLMs Beat Classical Hyperparameter Optimization Algorithms?
Hacker News (AI keywords)5 days agoBenchmark
This paper investigates whether LLMs can serve as effective hyperparameter optimization (HPO) agents, competing with established classical methods such as Bayesian optimization, TPE, and random search. The study likely employs a systematic evaluation framework where LLMs iteratively suggest hyperparameter configurations based on task descriptions and historical evaluation results. Findings aim to clarify the practical potential and limitations of LLMs in AutoML pipelines.
Warning: OpenCode Go/Zen Has No Account or Data Deletion Feature
r/LocalLLaMA top day5 days agoIncident
A Reddit user warns that OpenCode Go/Zen provides no mechanism for users to delete their account or personal data. Several GitHub issues have been filed but mostly ignored; one official response only said deletion would 'probably' be added eventually. For privacy-conscious developers, this is a significant red flag before signing up to the platform.
Build a Basic AI Agent from Scratch: Long Task Planning
Hacker News (AI keywords)5 days agoTutorial
This source appears to be a tutorial about constructing a basic AI agent from scratch. Based only on the title, its focus is likely long-task planning: how an agent breaks a larger objective into steps and works through them over time. No article body was provided, so specific implementation choices, model providers, tools, code examples, or evaluation results cannot be confirmed.
Single-slot half-height PCIe V100 with NVLink appears in China
r/LocalLLaMA top day5 days agoHardware
A r/LocalLLaMA post says a Bilibili creator has shown a single-slot, half-height PCIe V100 with NVLink on a custom PCB. The card is described as 16 cm long, passively cooled by default, capped at 75W, with another version supporting up to 300W. The 16GB model is expected around or below ¥1500, with a 32GB version reportedly planned, but it is not yet available for purchase.
Apple’s AI promises are finally, almost, sort of, here★ 72
The Verge AI5 days agoCommentary
Apple kicked off its annual developer conference with bold AI promises centered around a revamped "Siri AI" and Apple Intelligence. While CEO Tim Cook touted these as boundary-pushing innovations, the announcements largely represent Apple playing catch-up in the generative AI race. The slow, phased rollout suggests Apple is still struggling to match the rapid pace of competitors like Microsoft and Google.
Rick & Morty
r/LocalLLaMA top day5 days agoCommentary
This r/LocalLLaMA top-day post is a short image meme titled “Rick & Morty.” The only accompanying text says, “nobody expected HF there,” suggesting surprise at HF appearing in the image’s context. There are no technical claims, model details, releases, or benchmarks, so its value is mainly as a small signal of community culture around Hugging Face / HF and local LLM discussions.
Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85
Google DeepMind Blog5 days agoRelease
Google DeepMind has unveiled Gemma 4 12B, a next-generation open-weights model featuring a unified, encoder-free multimodal architecture. By eliminating the traditional separate vision encoder (such as ViT), it processes diverse modalities directly within a single Transformer network. This design simplifies training, reduces inference latency, and enhances cross-modal alignment, marking a significant milestone for open-source AI.
PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs
Hacker News (AI keywords)5 days agoPaper
This arXiv paper introduces PR-CAD, a framework for controllable and faithful text-to-CAD generation with large language models. It treats CAD creation and editing as one progressive refinement process rather than separate tasks. The authors curate an interaction dataset and report state-of-the-art controllability and faithfulness on public benchmarks.
PSA: Throttle GPU Power Limits for Major Energy Savings with Minimal Inference Performance Loss
r/LocalLLaMA top day5 days agoHardware
A Reddit user reminds the local LLM community that throttling GPU power limits offers outsized energy savings with minimal performance cost. On dual Radeon VII cards, cutting power from 250W to 100W per card resulted in less than 10% drop in inference speed. LLM inference is memory-bound rather than compute-bound, making it uniquely tolerant of reduced GPU clock speeds compared to training or rendering tasks.
Apple’s Best AI Idea Looks a Lot Like Vibe Coding★ 75
The Verge AI5 days agoCommentary
While Apple's standard AI features like chatbots and image generation play catch-up, its integration of AI with Shortcuts stands out. By allowing users to generate complex multi-app workflows and automate Safari tabs using simple natural language, Apple is bringing "vibe coding" to the masses. This approach shifts the focus from generic AI assistants to highly personalized, OS-level task automation.
Apple Announced a New On-Device Inference Engine for Apple Silicon
r/LocalLLaMA top day5 days agoRelease
Apple announced CoreAI at WWDC, which the post frames as a possible future replacement for CoreML and an alternative to MLX, llama.cpp, and torch for optimized on-device inference. Models still need conversion through Python scripts, and current supported models appear mostly from mid-2025. No performance data is available yet; the author expects it may trail MLX on GPU, but Apple’s 20B on-device foundation model claim suggests larger app-bundled models could become possible.
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
Hacker News (AI keywords)5 days agoPaper
Echoing the famous Transformer paper, this work asks whether grep alone is sufficient for agentic search scenarios. The study focuses on 'agent harnesses'—the scaffolding wrapping an LLM, including prompting strategy, tool access, and memory—as the primary driver of search quality. Findings suggest harness design may matter more than the underlying model, challenging the community's focus on model scaling.

← PreviousPage 8Next →

Latest in AI

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations

GPT-2: Too Dangerous To Release — A 2019 Retrospective

Releasing Cohere North Mini Code

Anthropic Requires Fable and Mythos Models to Retain Data for 30 Days★ 74

Watch agents fight: a live challenge to speed up Gemma 4 E4B inference on a single A10G

What it feels like to work with Mythos

Anthropic Releases Claude Fable 5, Its First Public Mythos-Class Model, With Guardrails for High-Risk Domains★ 76

Anthropic Releases Claude Fable 5, Its First Mythos-Class Model★ 78

System Card: Claude Fable 5 and Claude Mythos 5★ 82

Anthropic Launches Claude Fable 5★ 85

Launch HN: Transload (YC P26) – Measuring Freight Items with CCTV

Cohere North Mini Code 1.0

Unsloth Gemma 4 QAT MTP assistant models now available

TTS Benchmark Revamped with Objective Standards and Blind ELO Voting (46 Models)

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Introducing North Mini Code: Cohere's First Model For Developers

FCC Wants to Kill Burner Phones by Forcing Telecoms to Verify All Customers' IDs

Fluid, natural voice translation with Gemini 3.5 Live Translate

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

Warning: OpenCode Go/Zen Has No Account or Data Deletion Feature

Build a Basic AI Agent from Scratch: Long Task Planning

Single-slot half-height PCIe V100 with NVLink appears in China

Apple’s AI promises are finally, almost, sort of, here★ 72

Rick & Morty

Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85

PR-CAD: Progressive Refinement for Text-to-CAD Generation with LLMs

PSA: Throttle GPU Power Limits for Major Energy Savings with Minimal Inference Performance Loss

Apple’s Best AI Idea Looks a Lot Like Vibe Coding★ 75

Apple Announced a New On-Device Inference Engine for Apple Silicon

Is Grep All You Need? How Agent Harnesses Reshape Agentic Search