Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175524
Title: VScenimefy: fast and stable anime video stylization via diffusion prior distillation
Authors: Jiang, Yuxin
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Jiang, Y. (2024). VScenimefy: fast and stable anime video stylization via diffusion prior distillation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175524
Project: SCSE23-0378 
Abstract: Automatic rendering anime scenes from complex real-world images is of significant practical value. Prior works, from GANs to Diffusion-based methods, have shown impressive results in image-level translation. However, their extension to video exposes issues such as weak stylization, obvious flickering, and high computational demands. The core challenges of this task lie in 1) maintaining semantic preservation for complex scenes; 2) ensuring temporal consistency; 3) capturing the unique features of anime style. In this study, we propose VScenimefy, a novel three-stage video-to-video translation pipeline leveraging pretrained diffusion model’s prior as a pseudo supervision to narrow the domain gap. A low-level alignment technique with pseudo paired data is introduced to improve stylization and ensure structural integrity. Besides, we present an optical-flow-based video finetuning framework, yielding temporally consistent and stylistically coherent videos. Our extensive experiments demonstrate the superiority of our method over state-of-the-art baselines in terms of perceptual quality and quantitative performance. Remarkably, our model runs in real-time, confirming its feasibility for practical application and highlighting its potential to empower GANs for efficient video stylization through diffusion model distillation.
URI: https://hdl.handle.net/10356/175524
Schools: School of Computer Science and Engineering 
Fulltext Permission: embargo_restricted_20260401
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Jiang Yuxin_SCSE23-0378.pdf
  Until 2026-04-01
20.51 MBAdobe PDFUnder embargo until Apr 01, 2026

Page view(s)

129
Updated on Mar 25, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.