Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/148083
Title: | Training deep network models for accurate recognition of texts in scenes | Authors: | Chen, Cheng | Keywords: | Engineering::Computer science and engineering | Issue Date: | 2021 | Publisher: | Nanyang Technological University | Source: | Chen, C. (2021). Training deep network models for accurate recognition of texts in scenes. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148083 | Project: | SCSE20-0118 | Abstract: | Scene Text Recognition is an important task because of its many potential applications in the industries. However, Scene Text Recognition is also a challenging task in Computer Vision because of the irregularity and diversity of scene text images. Among these difficulties, low-resolution images are one of the major problems still yet to be perfectly solved. In this paper, a deep learning neural network specialised in scene text recognition is studied and implemented. Multiple ways to improvement model performance on low-resolution images are also investigated and compared. More specifically, two different strategies of handling low-resolution images are investigated: 1) Super-resolving images from feature level by incorporating a Super- Resolution Unit in the end-to-end trainable model; 2) Super-resolve images from image level through three different state-of-art super-resolution models. To ensure fair comparison, the TextZoom dataset is used throughout different rounds of experiments as it contains real-life low-resolution and high-resolution image pairs. | URI: | https://hdl.handle.net/10356/148083 | Schools: | School of Computer Science and Engineering | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCBE Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Chen Cheng FYP Report.pdf Restricted Access | 3.39 MB | Adobe PDF | View/Open |
Page view(s)
354
Updated on Sep 21, 2023
Download(s) 50
31
Updated on Sep 21, 2023
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.