Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/147958
Title: Mobile application on a scene text spotting
Authors: Nguyen Doan Hoang Lam
Keywords: Engineering::Computer science and engineering::Software::Software engineering
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Nguyen Doan Hoang Lam (2021). Mobile application on a scene text spotting. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/147958
Project: SCSE20-0091
Abstract: Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically identifying human identity by reading their identification card. With many important applications of text spotting, many attempts on implementing software applications that adopted different text spotting methods have been made. While these existing applications have eased many human activities that require text extraction from natural scenes, they also experienced some limitations. Some applications still require too many manual actions from users and cannot spot text in real-time while others may suffer from accuracy and performance issues due to the obsolete text spotting methods. To resolve those issues, this project proposed the implementation of a mobile application that adopted a well-known text spotting approach known as the Adaptive Bezier Curve Network. The performance of this approach, which has been evaluated on TotalText and CTW1500 dataset, proved to achieve a state-of-the-art accuracy while having considerably high inference speed compared to the other existing state-of-the-art methods. In addition to adopting this approach, the project has successfully built an application and a text spotting server using socket programming method as well as our own defined image streaming protocol. Finally, the experiments conducted to measure the performance of the application shows that it is capable of real-time text spotting with up to eight frames per seconds on average while retaining the state-of-the-art text spotting accuracy.
URI: https://hdl.handle.net/10356/147958
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYPReport.pdf
  Restricted Access
29.46 MBAdobe PDFView/Open

Page view(s)

332
Updated on Mar 24, 2025

Download(s) 50

39
Updated on Mar 24, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.