r/LocalLLaMA top dayJun 9, 2026, 6:43 PM/u/pmttyji

SCAIL-2: Open-Source End-to-End Character Animation Without Intermediate Pose Representations

Original: zai-org/SCAIL-2 · Hugging Face

SCAIL-2 is an open-source model that animates characters from driving videos end-to-end, eliminating skeleton or mask intermediates.

SCAIL-2 by zai-org removes the reliance on skeleton maps and inpainting masks common in prior character animation pipelines, driving characters directly from video in an end-to-end manner. Trained on 60K synthesized motion pairs using SCAIL-Preview, Wan-Animate, and MoCha via a Unified Motion Transfer Interface with RoPE design, the model develops emergent abilities beyond its teacher models. Capabilities include cross-identity character replacement, animal-driving scenarios, and zero-shot support for SAM3D-Body mesh rendering.

SCAIL-2 is a character animation model open-sourced by zai-org released on Hugging Face. Its core breakthrough lies in implementing "end-to-end" character-driven animation, completely freeing the previous approach from relying on intermediate poses.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on r/LocalLLaMA top day →

Summaries are AI-generated; the original article is authoritative.