Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/148083
Title: Training deep network models for accurate recognition of texts in scenes
Authors: Chen, Cheng
Keywords: Engineering::Computer science and engineering
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Chen, C. (2021). Training deep network models for accurate recognition of texts in scenes. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148083
Project: SCSE20-0118
Abstract: Scene Text Recognition is an important task because of its many potential applications in the industries. However, Scene Text Recognition is also a challenging task in Computer Vision because of the irregularity and diversity of scene text images. Among these difficulties, low-resolution images are one of the major problems still yet to be perfectly solved. In this paper, a deep learning neural network specialised in scene text recognition is studied and implemented. Multiple ways to improvement model performance on low-resolution images are also investigated and compared. More specifically, two different strategies of handling low-resolution images are investigated: 1) Super-resolving images from feature level by incorporating a Super- Resolution Unit in the end-to-end trainable model; 2) Super-resolve images from image level through three different state-of-art super-resolution models. To ensure fair comparison, the TextZoom dataset is used throughout different rounds of experiments as it contains real-life low-resolution and high-resolution image pairs.
URI: https://hdl.handle.net/10356/148083
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCBE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Chen Cheng FYP Report.pdf
  Restricted Access
3.39 MBAdobe PDFView/Open

Page view(s)

354
Updated on Sep 21, 2023

Download(s) 50

31
Updated on Sep 21, 2023

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.