Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/76861
Title: Scene text recognition
Authors: Muhammad Afiq Osman
Keywords: DRNTU::Engineering::Computer science and engineering
Issue Date: 2019
Abstract: Scene text recognition problem has recently seen interest within the deep learning community. Solving such a problem will inevitably open paths to more exciting inventions in the future such as for robotic navigation. However, the current solutions are far from perfect and there is potentially more that could be done. In this FYP we will investigate and attempt to replicate an existing study regarding scene text recognition. The goal is to theoretically understand and experience the practical side of implementing and training a deep learning model to tackle such a problem. We would undertake the hyperparameter tuning process in search of the optimal values for the batch size, learning rate, number of epochs and the learning optimizer. Optimal values found for batch size and learning rate coincides with the common rationale. The same goes for the number of epochs where the resulting trend suggests that as the number of epochs increases the model accuracy will start to plateau. However, as for the best optimizer was found to be Adam which was different from the original’s study optimizer of Adadelta. Adadelta in fact performed much worse producing ‘nan’ test and train error on many occasions. Future recommendation for this FYP includes experimenting with the CRNN model structure used in order to deeply understand the effects of the CNN and RNN layers used.
URI: http://hdl.handle.net/10356/76861
Schools: School of Computer Science and Engineering 
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
MuhammadAfiqBinOsman.pdf
  Restricted Access
1.6 MBAdobe PDFView/Open

Page view(s)

319
Updated on Jul 17, 2024

Download(s) 50

95
Updated on Jul 17, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.