Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/136597
Title: A novel pipeline for table extraction using deep learning
Authors: Lee, Seng Cheong
Keywords: Engineering::Computer science and engineering
Issue Date: 2019
Publisher: Nanyang Technological University
Abstract: Table extraction refers to the detection and extraction of tables from documents and images while preserving their structural layout and content. With the ever-growing volume of digital files and content, there is an increasing demand for the automated extraction of tables for consumption in a programmatic format, as well as in support of advanced applications such as information retrieval and natural language processing. This project proposes an automated pipeline for table extraction using convolutional neural networks (CNN). The pipeline consists of a table detection module, which detects the presence of tables and extract the table regions using an object detection CNN model, and a table structure recognition module, which extracts table cells and their contents before reconstructing the table structure. To enhance performance of the table detection module, modifications were implemented into the table detection model and evaluated against their non-modified versions. The report will first review existing literature for table detection and table structure recognition. Next, the report introduces the datasets utilized for training, as well as data augmentation methods, the architectures utilized in the evaluation of single-stage approaches and experiments on modifications carried out to improve performance. The evaluation metrics and results will then be presented and discussed. Several experiments carried out in this project were discovered to show promising results over their non-modified counterparts. Additionally, the pipeline was successfully demonstrated to perform table extraction, thus demonstrating the viability of the overall process.
URI: https://hdl.handle.net/10356/136597
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Lee Seng Cheong FYP Report.pdf
  Restricted Access
2.76 MBAdobe PDFView/Open
Lee Seng Cheong FYP Poster.pdf
  Restricted Access
1.24 MBAdobe PDFView/Open
alma991016441109405146.html
  Restricted Access
138 BHTMLView/Open

Page view(s) 50

685
Updated on May 5, 2025

Download(s) 50

131
Updated on May 5, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.