Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/181681
Title: The old newspaper project
Authors: Li, JiaGeng
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Li, J. (2024). The old newspaper project. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181681
Abstract: In this project, we will create a newspaper image recognition app based on AI which uses Computer Vision, Optical Character Recognition (OCR) technology and Natural Language Processing (NLP) models to recognize, classify and extract information from images of newspapers. The project follows an iterative process; it begins with the investigation of Tesseract OCR and Yolov8 in a pre-alpha version to extract text and analyze layout. Having faced limitations we shifted to advanced implementations such as PaddleOCR along with LayoutParser for styling newspaper layouts and document parsing and using GPT-3 models for detailed outline writing of the extracted content., In this report, we describe the project context, technology choices, how we collected data is followed by some challenges we faced while building the application and how we solved those issues to improve the application's functionality.
URI: https://hdl.handle.net/10356/181681
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYPReport_LiJiaGeng.pdf
  Restricted Access
884.2 kBAdobe PDFView/Open

Page view(s)

74
Updated on May 5, 2025

Download(s)

4
Updated on May 5, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.