Latest in AI

Showing:DesignersOpen-sourceClear ×

🔥 Trending today

anthropic7 export-controls5 model-access3 ai-infrastructure3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

SenseNova U1 Adds an Infographic-Specific Fine-Tune
r/LocalLLaMA top day4 days agoRelease
A Reddit post highlights a new infographic-specific fine-tune for SenseNova U1-8B-MoT, trained with an extended multi-task phase for structured visual output. The reported benchmarks show large gains in IGenBench infographic accuracy and chart understanding, with smaller improvement in text rendering. Aesthetic score appears roughly unchanged, suggesting the update mainly improves information structure and visual reasoning rather than overall visual polish.
SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations
r/LocalLLaMA top day4 days agoRelease
SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72
Hugging Face Blog5 days agoTutorial
This Hugging Face blog post demonstrates how AI agents can use Spaces as modular tools. By chaining an image generation Space with a 3D rendering Space, an agent automatically generated art assets and placed them inside a virtual 3D gallery. This highlights the power of Hugging Face's ecosystem, where any Space can serve as an API for agentic workflows.
ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74
量子位 QbitAI5 days agoRelease
ByteDance’s commercial technology team has open-sourced Bernini, a unified framework for AI video generation and editing. Its design separates semantic planning from visual rendering: an MLLM-based planner understands text, source videos, images, and video references, then a DiT-based renderer produces the final video. The released Bernini-R includes inference code and weights, while the full planner-enabled version is still being prepared.
Building Pakistan Notice Helper: A Small AI Tool for a Very Local Safety Problem
Hugging Face Blog6 days agoNew Tool
Pakistan Notice Helper is a Build Small Hackathon project focused on suspicious notices in Pakistan, including bank, courier, tax, telecom, police, and government-style messages. It accepts text or screenshots, supports English and Urdu, and returns risk labels, red flags, explanations, and safer next steps. The author discusses choosing Qwen3.5 4B Q8 with llama.cpp, Modal, Gradio, and Hugging Face Spaces after balancing quality, cost, latency, cold starts, and safety constraints.
JoyAI-Echo open-source framework targets stable 5-minute AI long videos★ 72
量子位 QbitAI6 days agoNew Tool
QbitAI reports that JD’s team has open-sourced JoyAI-Echo, a long audio-video generation framework for multi-minute AI videos. It targets character drift, unstable voice, slow inference, and blurry output through cross-modal memory, memory-driven post-training, and lightweight real-time super-resolution. The system also includes a Director Agent for script planning, shot-level generation, localized edits, and iterative video production.
Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas
Hacker News (AI keywords)7 days agoNew Tool
office-open-xml-viewer is an open-source browser viewer for Office Open XML documents, rendering DOCX, XLSX, and PPTX files to HTML Canvas. Its parsers are written in Rust and compiled to WebAssembly, while rendering uses the Canvas 2D API. The README also says the full codebase was implemented by Claude through iterative prompting, making it notable as an AI-assisted software development case.
Magenta RealTime 2: An Open, Locally Runnable Real-Time Music Model★ 74
Hacker News (AI keywords)9 days agoRelease
Magenta RealTime 2 is an open-weights live music model designed for interactive performance rather than offline prompt-to-song generation. It supports real-time control through MIDI, audio, and text, and can run as standalone apps, DAW plugins, or embedded music software. Google Magenta also released a Python library, C++ MLX inference engine, models, and example applications for musicians and developers.
Reve 2 and Ideogram 4: Layouts in Imagegen
Latent Space10 days agoRelease
Latent Space’s roundup frames image composition as a major barrier now being tackled by layout-aware image models. Reve 2.0 emphasizes precise generation and editing with layouts, while Ideogram 4.0 uses bounding boxes tied to region descriptions. The issue also covers MAI-Thinking-1, Gemma 4 12B, open audio models, agent execution layers, and model-routing cost debates.
As browser wars heat up, top Chrome and Safari alternatives in 2026
TechCrunch AI15 days agoCommentary
TechCrunch frames 2026’s browser competition around alternatives to Chrome and Safari. The roundup covers AI-centric browsers like Perplexity Comet, Dia, Opera Neon, OpenAI Atlas, and Aside, alongside privacy-focused options such as Brave, DuckDuckGo, Ladybird, and Vivaldi. It also highlights niche products including Opera Air, SigmaOS, and Zen Browser, showing how browsers are becoming AI assistants, productivity hubs, privacy layers, and wellness-oriented tools.
如何使用 Seedance 2.0 製作出色的 AI 影片★ 70
Replicate Blog60 days agoTutorial
### A New Era of AI Video Generation: Why Now Is the Best Time With the rapid evolution of generative AI technology, video generation has transformed from an…
用任何自訂前端搭配 Gradio 後端：全新 Gradio Server 登場★ 78
Hugging Face Blog74 days agoRelease
### Gradio's Major Transformation: From Prototyping Tool to Production Backend Gradio has long been the go-to tool for machine learning developers to quickly…
使用 Gradio 的 gr.HTML 元件，一鍵生成並運行任何網頁應用程式 (One-Shot Web App)★ 80
Hugging Face Blog116 days agoNew Tool
With the explosion of large language models (LLMs) in the code generation space, features like Claude Artifacts that can "generate a complete web application…
Diffusers 正式支援 FLUX-2：下一代開源圖像生成模型降臨★ 85
Hugging Face Blog201 days agoRelease
The Hugging Face official blog has announced that the popular diffusion model library `diffusers` now officially supports FLUX-2, the next-generation…
FLUX.2 正式上架 Replicate：支援多重參考與專業級圖像生成★ 80
Replicate Blog201 days agoRelease
The cloud AI hosting platform Replicate has officially announced support for FLUX.2, the next-generation image generation model developed by Black Forest Labs…
Retro Diffusion 像素藝術模型正式上架 Replicate：輕鬆生成遊戲素材、角色精靈與地圖瓷磚
Replicate Blog207 days agoNew Tool
The well-known pixel art AI model suite Retro Diffusion has officially launched on the cloud AI hosting platform Replicate. For indie game developers, game…
VibeGame：探索「氛圍編碼」（Vibe Coding）遊戲的全新可能★ 75
Hugging Face Blog258 days agoRelease
"Vibe Coding" is one of the hottest topics in the AI world right now (popularized by figures like former Tesla AI Director Andrej Karpathy). It refers to a…
該選擇哪款影像編輯模型？Replicate 最新影像編輯模型終極評比指南★ 78
Replicate Blog264 days agoTutorial
With the rapid advancement of generative AI, image editing is no longer limited to simple text-to-image generation. Replicate has published a comprehensive…
使用 Claude 與 Hugging Face 產生圖片：透過 MCP 協定無縫整合★ 85
Hugging Face Blog299 days agoNew Tool
The Hugging Face official blog has announced an exciting new integration: through Anthropic's Model Context Protocol (MCP), users can now generate images…
開源影片生成模型回歸：Replicate 推出最快、最便宜的 Wan 2.2 模型★ 75
Replicate Blog318 days agoRelease
Replicate has announced official support for the brand-new open-source video generation model Wan 2.2 on its platform, declaring that "open-source video…
如何生成一致的角色？Replicate 評測主流圖像模型的一致性生成能力★ 75
Replicate Blog328 days agoTutorial
In the field of AI image generation, maintaining visual consistency for the same character across different scenes, actions, and expressions — known as…
Replicate 評測：如何選擇最適合你的 AI 影片生成模型★ 78
Replicate Blog342 days agoTutorial
AI video generation technology has made breakthrough advances over the past year — from closed-source systems like Sora and Runway to a flourishing open-source…
Replicate 與 Black Forest Labs 聯手舉辦 FLUX.1 Kontext 黑客松：探索下一代 AI 圖像生成應用
Replicate Blog348 days agoRelease
Cloud AI deployment platform Replicate recently announced that the "FLUX.1 Kontext Hackathon," co-hosted with renowned open-source image generation model…
在消費級硬體上微調 FLUX.1-dev：使用 QLoRA 技術指南★ 80
Hugging Face Blog360 days agoTutorial
FLUX.1-dev is a state-of-the-art open-source text-to-image model with 12 billion parameters (12B), developed by Black Forest Labs. However, due to its enormous…
在 Arm 架構上實現即時 AI 聲音生成：賦予創意自由的個人工具
Hugging Face Blog376 days agoRelease
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
FLUX.1 Kontext 社群應用大盤點：探索 AI 圖像「上下文生成」的全新創意
Replicate Blog377 days agoCommentary
### FLUX.1 Kontext Sparks a New Wave of "In-Context Image Generation" Since Black Forest Labs introduced FLUX.1, this open-source image generation model has…
使用 FLUX.1 Kontext 用文字輕鬆編輯圖片：Black Forest Labs 最新圖像編輯模型實戰指南★ 80
Replicate Blog381 days agoRelease
Black Forest Labs (the development team behind the FLUX series of models) has launched a new image editing model called "FLUX.1 Kontext." This model is…
使用 Wan2.1 進行風格化影片生成：結合 LoRA 實現獨特視覺風格★ 75
Replicate Blog439 days agoTutorial
Alibaba's open-source Wan2.1 is a video generation model that has been receiving widespread attention, and Replicate's latest guide focuses on how to use LoRA…
Replicate 創意精選：頭像生成、光劍特效與 LoRA 實用技巧
Replicate Blog443 days agoRelease
Replicate recently published the latest edition of its "Creative Roundup," showcasing fun experiments and practical tools built by community members using…
Wan2.1 影片生成模型上線 Replicate：透過 API 單行程式碼即可生成高品質影片★ 75
Replicate Blog466 days agoNew Tool
With the rapid advancement of open-source AI, the field of video generation has seen a major breakthrough. Cloud-based AI hosting platform Replicate recently…

Page 1Next →

Latest in AI

SenseNova U1 Adds an Infographic-Specific Fine-Tune

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces★ 72

ByteDance Open-Sources Bernini, a Unified Framework for AI Video Editing★ 74

Building Pakistan Notice Helper: A Small AI Tool for a Very Local Safety Problem

JoyAI-Echo open-source framework targets stable 5-minute AI long videos★ 72

Office-open-xml-viewer: Office XML document viewer rendering to HTML Canvas

Magenta RealTime 2: An Open, Locally Runnable Real-Time Music Model★ 74

Reve 2 and Ideogram 4: Layouts in Imagegen

As browser wars heat up, top Chrome and Safari alternatives in 2026

如何使用 Seedance 2.0 製作出色的 AI 影片★ 70

用任何自訂前端搭配 Gradio 後端：全新 Gradio Server 登場★ 78

使用 Gradio 的 gr.HTML 元件，一鍵生成並運行任何網頁應用程式 (One-Shot Web App)★ 80

Diffusers 正式支援 FLUX-2：下一代開源圖像生成模型降臨★ 85

FLUX.2 正式上架 Replicate：支援多重參考與專業級圖像生成★ 80

Retro Diffusion 像素藝術模型正式上架 Replicate：輕鬆生成遊戲素材、角色精靈與地圖瓷磚

VibeGame：探索「氛圍編碼」（Vibe Coding）遊戲的全新可能★ 75

該選擇哪款影像編輯模型？Replicate 最新影像編輯模型終極評比指南★ 78

使用 Claude 與 Hugging Face 產生圖片：透過 MCP 協定無縫整合★ 85

開源影片生成模型回歸：Replicate 推出最快、最便宜的 Wan 2.2 模型★ 75

如何生成一致的角色？Replicate 評測主流圖像模型的一致性生成能力★ 75

Replicate 評測：如何選擇最適合你的 AI 影片生成模型★ 78

Replicate 與 Black Forest Labs 聯手舉辦 FLUX.1 Kontext 黑客松：探索下一代 AI 圖像生成應用

在消費級硬體上微調 FLUX.1-dev：使用 QLoRA 技術指南★ 80

在 Arm 架構上實現即時 AI 聲音生成：賦予創意自由的個人工具

FLUX.1 Kontext 社群應用大盤點：探索 AI 圖像「上下文生成」的全新創意

使用 FLUX.1 Kontext 用文字輕鬆編輯圖片：Black Forest Labs 最新圖像編輯模型實戰指南★ 80

使用 Wan2.1 進行風格化影片生成：結合 LoRA 實現獨特視覺風格★ 75

Replicate 創意精選：頭像生成、光劍特效與 LoRA 實用技巧

Wan2.1 影片生成模型上線 Replicate：透過 API 單行程式碼即可生成高品質影片★ 75