Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhang, Weilunen_US
dc.identifier.citationZhang, W. (2022). Training deep network models for accurate recognition of texts in scene images. Final Year Project (FYP), Nanyang Technological University, Singapore.
dc.description.abstractWith the advent of the industrial 4.0 era, computer vision and its applications have played an increasingly significant role in the digitization of industries. Images that contain text can be used as sources of information for automating processes. Text recognition from scene images constitutes still a very challenging task because of various backgrounds, variations in size and space, as well as irregular arrangements. The first half of this paper will study and analyse a deep learning model for word recognition in scene images using deep learning. As part of this work, CRNN, one of the most extensively used state-of-the-art deep learning models for text recognition, will be implemented and trained. A number of hyperparameters will be tweaked to obtain optimum performance. In addition, further improvements to CRNN recognition accuracy will be explored. Different image super-resolution techniques and the ImageFilter models will be utilized and compared as a pre-processing stage for recognition in the second part of this paper.en_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Image processing and computer visionen_US
dc.titleTraining deep network models for accurate recognition of texts in scene imagesen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorLu Shijianen_US
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
item.fulltextWith Fulltext-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
Zhang Weilun FYP Report.pdf
  Restricted Access
2.48 MBAdobe PDFView/Open

Page view(s)

Updated on Jul 1, 2022


Updated on Jul 1, 2022

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.