Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/156440
Title: Be a cartoonist : editing anime images using generative adversarial network
Authors: Koh, Tong Liang
Keywords: Generative Adversarial Networks
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Koh, T. L. (2022). Be a cartoonist : editing anime images using generative adversarial network. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156440
Project: SCSE21-0365
Abstract: With the rise in popularity of generative models, many studies have started to look at furthering its applicability as well as its performance. One such application is in image-to-image translation which can be used to transform an image from domain A to domain B. However, in a scenario where the domain differs greatly in structure such as between real faces and cartoon faces, it can be difficult to perform high quality translation while retaining original identities. Currently, some existing works suggested the use of cycle consistency, few-shot training in image-to-image translation pipelines, while others recommend layer swapping and freezing lower-resolution generator layers on top of a well pretrained StyleGAN. However, these solutions are ineffective in translating real faces to anime images due to the difference in face structure. To address this problem, we introduce perceptual loss and featurebased multi-discriminators to supervise the training process with the help of the offthe-shelf StyleGAN trained on real image domain. This way we would be able to retain the original identity of the face after translating the image into another anime domain. We then explore anime image editing using closed-form factorisation to edit semantic details such as expression, pose and hair styles. In this project, we also explore StyleGAN compression by using knowledge distillation, since the StyleGAN has millions of parameter and it is difficult to utilise StyleGAN model on edge devices which have low computational budget.
URI: https://hdl.handle.net/10356/156440
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Report_SCSE21_0365_Final_Report.pdf
  Restricted Access
21.11 MBAdobe PDFView/Open

Page view(s)

171
Updated on Dec 3, 2023

Download(s) 50

58
Updated on Dec 3, 2023

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.