PersonaLive! Expressive Portrait Image Animation for Live Streaming
Summary
PersonaLive is a diffusion-based framework for real-time expressive portrait animation in live streaming, achieving significant speedups through hybrid implicit signals and autoregressive streaming generation.
View Cached Full Text
Cached at: 05/08/26, 08:42 AM
Paper page - PersonaLive! Expressive Portrait Image Animation for Live Streaming
Source: https://huggingface.co/papers/2512.11253
Abstract
PersonaLive is a diffusion-based portrait animation framework that improves real-time performance through hybrid implicit signals, appearance distillation, and autoregressive streaming generation.
Currentdiffusion-based portrait animationmodels predominantly focus on enhancing visual quality and expression realism, while overlooking generation latency and real-time performance, which restricts their application range in the live streaming scenario. We propose PersonaLive, a novel diffusion-based framework towards streaming real-time portrait animation with multi-stage training recipes. Specifically, we first adopthybrid implicit signals, namelyimplicit facial representationsand3D implicit keypoints, to achieve expressive image-level motion control. Then, a fewer-stepappearance distillationstrategy is proposed to eliminate appearance redundancy in the denoising process, greatly improving inference efficiency. Finally, we introduce anautoregressive micro-chunk streaming generationparadigm equipped with asliding training strategyand ahistorical keyframe mechanismto enable low-latency and stable long-term video generation. Extensive experiments demonstrate that PersonaLive achieves state-of-the-art performance with up to 7-22x speedup over priordiffusion-based portrait animationmodels.
View arXiv pageView PDFGitHub3.05kAdd to collection
Get this paper in your agent:
hf papers read 2512\.11253
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper4
#### huaichang/PersonaLive Image-to-Video• UpdatedDec 26, 2025 • 133
#### suryatmodulus/PersonaLive Image-to-Video• Updated2 days ago • 2
#### Darell0009/SuperCam_Models Image-to-Video• UpdatedMar 4
#### ballemann/PersonaLive Image-to-Video• Updatedabout 1 month ago
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2512.11253 in a dataset README.md to link it from this page.
Spaces citing this paper1
Collections including this paper11
Similar Articles
Kyohansha
Kyohansha is a web-based product delivering 60FPS Live2D AI avatars equipped with Lite-RAG long-term memory.
PersonaArena: Dynamic Simulation for Evaluating and Enhancing Persona-Level Role-Playing in Large Language Models
PersonaArena is a dynamic simulation framework that uses a large corpus of social content and a multi-agent debating judge to evaluate and improve LLMs' ability to maintain coherent and authentic persona-level role-playing in realistic social scenarios.
Persona.js
Persona.js is a JavaScript library that adds WebMCP-native AI chat to any frontend.
One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents
Introduces PCSP, a single RL policy conditioned on frozen LLM embeddings of persona descriptions, enabling scalable, real-time persona-traceable NPC control in life simulation games. Experiments show zero-shot persona identification and behavioral alignment, with faster inference than LLM baselines.
ProactiveLLM: Learning Active Interaction for Streaming Large Language Models
ProactiveLLM introduces a method for streaming LLMs to actively decide when to generate output based on endogenous cues, using mask-based streaming modeling and synchronized privileged self-distillation, reducing latency without external annotations.