Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessNscale moves into power with AIPCorp deal, building 8GW U.S. AI campus to bypass energy bottlenecks - EdgeIRGNews AI USAFoxconn's 30% AI Surge Shifts to NVIDIA - 조선일보GNews AI NVIDIAAI data center boom ‘stress tests’ insurers as private capital floods inCNBC Technologymorningbrew.comRun Google’s new Gemma 4 AI models locally on Android and iOS: Here’s how - Business TodayGNews AI Gemmatrunk/bac8607b42eebcd1173c3c8b6a6afa62ccb4c3b8: [vllm hash update] update the pinned vllm hash (#179439)PyTorch ReleasesTop 4 skills AI still can’t master in 2026 and why they protect your job - financialexpress.comGNews AI jobsIs Ingram Micro (INGM) Using Microsoft’s Frontier Status To Redefine Its AI Distribution Moat? - Yahoo! Finance CanadaGNews AI MicrosoftThe Greatest Risk of AI in Higher Education Isn’t Cheating – It’s the Erosion of Learning Itself - The Good Men ProjectGNews AI educationIndia Markets Underperform Despite Global AI Boom - Let's Data ScienceGNews AI IndiaAI Chip Smuggling: The Limits of US Export Controls - Bloomsbury Intelligence and Security Institute (BISI)GNews AI USA€500 billion-worth European data economy troubles continue - Euronews.comGNews AI EUBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessNscale moves into power with AIPCorp deal, building 8GW U.S. AI campus to bypass energy bottlenecks - EdgeIRGNews AI USAFoxconn's 30% AI Surge Shifts to NVIDIA - 조선일보GNews AI NVIDIAAI data center boom ‘stress tests’ insurers as private capital floods inCNBC Technologymorningbrew.comRun Google’s new Gemma 4 AI models locally on Android and iOS: Here’s how - Business TodayGNews AI Gemmatrunk/bac8607b42eebcd1173c3c8b6a6afa62ccb4c3b8: [vllm hash update] update the pinned vllm hash (#179439)PyTorch ReleasesTop 4 skills AI still can’t master in 2026 and why they protect your job - financialexpress.comGNews AI jobsIs Ingram Micro (INGM) Using Microsoft’s Frontier Status To Redefine Its AI Distribution Moat? - Yahoo! Finance CanadaGNews AI MicrosoftThe Greatest Risk of AI in Higher Education Isn’t Cheating – It’s the Erosion of Learning Itself - The Good Men ProjectGNews AI educationIndia Markets Underperform Despite Global AI Boom - Let's Data ScienceGNews AI IndiaAI Chip Smuggling: The Limits of US Export Controls - Bloomsbury Intelligence and Security Institute (BISI)GNews AI USA€500 billion-worth European data economy troubles continue - Euronews.comGNews AI EU
AI NEWS HUBbyEIGENVECTOREigenvector

StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2512.22065v2 Announce Type: replace-cross Abstract: Real-time, streaming interactive avatars represent a critical yet challenging goal in digital human research. Although diffusion-based human avatar generation methods achieve remarkable success, their non-causal architecture and high computational costs make them unsuitable for streaming. Moreover, existing interactive approaches are typically restricted to the head-and-shoulder region, limiting their ability to produce gestures and body motions. To address these challenges, we propose a two-stage autoregressive adaptation and accelerat — Zhiyao Sun, Ziqiao Peng, Yifeng Ma, Yi Chen, Zhengguang Zhou, Zixiang Zhou, Guozhen Zhang, Youliang Zhang, Yuan Zhou, Qinglin Lu, Yong-Jin Liu

Authors:Zhiyao Sun, Ziqiao Peng, Yifeng Ma, Yi Chen, Zhengguang Zhou, Zixiang Zhou, Guozhen Zhang, Youliang Zhang, Yuan Zhou, Qinglin Lu, Yong-Jin Liu

View PDF HTML (experimental)

Abstract:Real-time, streaming interactive avatars represent a critical yet challenging goal in digital human research. Although diffusion-based human avatar generation methods achieve remarkable success, their non-causal architecture and high computational costs make them unsuitable for streaming. Moreover, existing interactive approaches are typically restricted to the head-and-shoulder region, limiting their ability to produce gestures and body motions. To address these challenges, we propose a two-stage autoregressive adaptation and acceleration framework that applies autoregressive distillation and adversarial refinement to adapt a high-fidelity human video diffusion model for real-time, interactive streaming. To ensure long-term stability and consistency, we introduce three key components: a Reference Sink, a Reference-Anchored Positional Re-encoding (RAPR) strategy, and a Consistency-Aware Discriminator. Building on this framework, we develop a one-shot, interactive, human avatar model capable of generating both natural talking and listening behaviors with coherent gestures. Extensive experiments demonstrate that our method achieves state-of-the-art performance, surpassing existing approaches in generation quality, real-time efficiency, and interaction naturalness. Project page: this https URL .

Comments: Accepted by CVPR 2026. Project page: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)

Cite as: arXiv:2512.22065 [cs.CV]

(or arXiv:2512.22065v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2512.22065

arXiv-issued DOI via DataCite

Submission history

From: Zhiyao Sun [view email] [v1] Fri, 26 Dec 2025 15:41:24 UTC (14,125 KB) [v2] Sat, 28 Mar 2026 07:02:24 UTC (14,520 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
StreamAvata…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 211 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers