Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/172067
Title: | Image and video generation via deep learning | Authors: | Jiang, Liming | Keywords: | Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision | Issue Date: | 2023 | Publisher: | Nanyang Technological University | Source: | Jiang, L. (2023). Image and video generation via deep learning. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172067 | Project: | NTU NAP | Abstract: | Despite the immense success in image and video generation, several important problems still exist. This thesis aims at addressing the remaining challenges through advanced deep learning techniques. The first attempt is to construct a large-scale facial video dataset, DeeperForensics-1.0, to facilitate the following research and prevent the negative impact of generated data via better video manipulation. After securing the countermeasures, a versatile Two-Stream Image-to-image Translation (TSIT) framework is proposed, which has high practical value. Besides, the thesis tackles the remaining issues through a more fundamental and theoretical study, focal frequency loss (FFL), a frequency-level loss function that is complementary to existing spatial losses. The thesis further introduces Adaptive Pseudo Augmentation (APA) for GAN training with limited data, reducing the data requirements. Extensive experiments and analyses showcase the effectiveness of the proposed methods in both perceptual quality and quantitative evaluations. Finally, the thesis envisions potential future work, offering more insights into this field. | URI: | https://hdl.handle.net/10356/172067 | DOI: | 10.32657/10356/172067 | Schools: | School of Computer Science and Engineering | Rights: | This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Liming_Jiang_PhD_Thesis_Final.pdf | 12.75 MB | Adobe PDF | ![]() View/Open |
Page view(s)
64
Updated on Dec 6, 2023
Download(s)
13
Updated on Dec 6, 2023
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.