Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/183926
Title: | Adapting learning-based video retargeting to mainstream video application | Authors: | Ng, Donovan Chao Yu | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Ng, D. C. Y. (2025). Adapting learning-based video retargeting to mainstream video application. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/183926 | Project: | CCDS24-0525 | Abstract: | Generation-based learning methods have achieved great success in image retargeting. Additionally, utilizing both segmentation and inpainting has achieved great success in user satisfaction. However, the current limitations arise when the user wishes to manually select which part to segment, and also to make the object more realistic after the retargeting. This paper aims to address the above two limitations. We aim to explore segmentation models to facilitate a more flexible paradigm for each video frame which can help improve the accuracy of image retargeting and improve the quality of the retargeted image and additionally add more ways of relocating the object. The proposed pipeline combining the approach of the ORPVR, Segment anything 2 (SAM2) as well as the harmonizer has yielded promising results. A limitation was discovered for the object relocation algorithm and addressed accordingly with the improved widescreen to portrait mode for the object relocation algorithm to retarget images from widescreen (16:9) to (9:16). We also evaluate the components of the pipeline to see how it could be applied in a real world setting and discovered that the inpainting segment and the number of frames was the main contributing factor for the total time taken to run the proposed pipeline. We also discuss the possible limitations which was revealed on custom image retargeting dimentsions such as portrait (9:16) to widescreen (16:9) and future work to make the relocation algorithm more suitable for a wider variety of dimensions. | URI: | https://hdl.handle.net/10356/183926 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FYP_final_report (1).pdf Restricted Access | FYP-CCDS24-0525 | 13.97 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.