Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184060
Title: Smooth and long video generation
Authors: Wu, Bryan Jiahe
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Wu, B. J. (2025). Smooth and long video generation. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184060
Project: CCDS24-0491
Abstract: Diffusion models have gained significant traction in recent years for image generation tasks due to their ability to produce high-quality outputs while avoiding common pitfalls of adversarial approaches like Generative Adversarial Networks. Their probabilistic framework, which progressively denoises random noise, ensures stability during training and allows precise control over the generative process. More recently, researchers have extended diffusion models to video generation, with promising results from methods such as AnimateDiff and LaVie. However, generating long videos remains an open challenge, as issues such as temporal coherence deterioration and high computational costs limit scalability. This final year project explores training-free approaches to address these challenges, leveraging pre-trained video generation models to improve the length and temporal consistency of generated videos. Building on the capabilities of AnimateDiff, the project aims for computational efficiency, avoiding resource-intensive training while delivering advancements in video length and temporal coherence. These contributions pave the way for practical applications of video diffusion models in fields such as film, virtual reality, education, gaming, and the advertising industry.
URI: https://hdl.handle.net/10356/184060
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP Final Report.pdf
  Restricted Access
47.3 MBAdobe PDFView/Open

Page view(s)

30
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.