About me

I am currently a Researcher at International Digital Economy Academy (IDEA). Previously, I received my bachelor's and master's degrees from South China University of Technology, where I was advised by Prof. Yuhui Quan and Prof. Yong Xu.
My research has evolved from image processing to image and video segmentation, then to AIGC generation and digital humans. My current interests focus on human-centric content understanding and generation, and their integration with embodied systems, including multimodal understanding and generation, humanoid robot motion tracking, reinforcement learning policy learning, and world models.

News

[2025.07] CanonSwap was accepted to ICCV 2025.

[2022.12] HWFI was published in IJCV 2022.

[2023.04] Joined IDEA as a Researcher.

Work and Internship Experience

[2023.04 - Present] Senior Researcher at International Digital Economy Academy (IDEA).

[2021.07 - 2023.04] Researcher at Tencent ARC Lab.

[2020.04 - 2021.06] Research Intern at Tencent ARC Lab.

IDEA Tencent ARC SCUT

Research

My interests span low-level image processing, image and video segmentation, AIGC generation, digital humans, multimodal understanding and generation, and embodied intelligence including humanoid robot motion tracking, RL policies, and world models.
* equal contribution, # corresponding author

PEAR paper

PEAR: Pixel-aligned Expressive humAn mesh Recovery

Jiahao Wu, Yunfei Liu, Lijian Lin, Ye Zhu, Lei Zhu, Jingyi Li, Yu Li

SIGGRAPH 2026

Pixel-aligned expressive human mesh recovery for high-fidelity dynamic human reconstruction.

Qffusion paper

Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning

Maomao Li, Lijian Lin, Yunfei Liu, Ye Zhu, Yu Li

TVCG 2025

Controllable portrait video editing with quadrant-grid attention for precise semantic manipulation.

IPTalker paper

Identity-Preserving Video Dubbing Using Motion Warping

Runzhen Liu, Qinjie Lin, Yunfei Liu, Lijian Lin, Ye Zhu, Yu Li, Chuhua Xian, Fa-Ting Hong

IJCV 2025

Identity-preserving video dubbing with motion warping for expressive and temporally consistent speech transfer.

CanonSwap paper

CanonSwap: High-Fidelity and Consistent Video Face Swapping via Canonical Space Modulation

Xiangyang Luo, Ye Zhu#, Yunfei Liu, Lijian Lin, Cong Wan, Zijian Cai, Shao-Lun Huang#, Yu Li

ICCV 2025

Canonical-space modulation for high-fidelity and temporally consistent video face swapping.

GUAVA paper

GUAVA: Generalizable Upper Body 3D Gaussian Avatar

Dongbin Zhang, Yunfei Liu, Lijian Lin, Ye Zhu, Yang Li, Minghan Qin, Yu Li, Haoqian Wang

ICCV 2025

A generalizable 3D Gaussian avatar framework for upper-body human reconstruction and animation.

HRAvatar paper

HRAvatar: High-Quality and Relightable Gaussian Head Avatar

Dongbin Zhang, Yunfei Liu, Lijian Lin, Ye Zhu, Kangjie Chen, Minghan Qin, Yu Li, Haoqian Wang

CVPR 2025

A relightable Gaussian head avatar method for high-quality dynamic portrait reconstruction.

TEASER paper

TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction

Yunfei Liu, Lei Zhu, Lijian Lin, Ye Zhu, Ailing Zhang, Yu Li

ICLR 2025

Token-enhanced spatial modeling for detailed and robust expression reconstruction.

SGHM paper

Robust Human Matting via Semantic Guidance

Xiangguang Chen*, Ye Zhu*, Yu Li, Bingtao Fu, Lei Sun, Ying Shan, Shan Liu

ACCV 2022

Semantic guidance improves robustness in challenging human matting scenarios.

Harmonization paper

Composite photograph harmonization with complete background cues

Yazhou Xing, Yu Li, Xintao Wang, Ye Zhu, Qifeng Chen

ACM MM 2022

Background-aware harmonization for realistic composite photograph editing.

HWFI paper

HWFI: Hybrid Warping Fusion for Video Frame Interpolation

Yu Li*, Ye Zhu*, Ruoteng Li, Xintao Wang, Yue Luo, Ying Shan

IJCV 2022

Hybrid warping fusion for accurate and temporally coherent video frame interpolation.

Deblurring paper

Attentive Deep Network for Blind Motion Deblurring on Dynamic Scenes

Yong Xu, Ye Zhu, Yuhui Quan, Hui Jiu

CVIU 2021

An attentive deep architecture for blind motion deblurring in dynamic scenes.

Depth paper

Enforcing Temporal Consistency in Video Depth Estimation

Siyuan Li, Yue Luo, Ye Zhu, Xun Zhao, Yu Li, Ying Shan

ICCVW 2021

Temporal consistency constraints for stable and accurate video depth estimation.