Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/136597
Title: | A novel pipeline for table extraction using deep learning | Authors: | Lee, Seng Cheong | Keywords: | Engineering::Computer science and engineering | Issue Date: | 2019 | Publisher: | Nanyang Technological University | Abstract: | Table extraction refers to the detection and extraction of tables from documents and images while preserving their structural layout and content. With the ever-growing volume of digital files and content, there is an increasing demand for the automated extraction of tables for consumption in a programmatic format, as well as in support of advanced applications such as information retrieval and natural language processing. This project proposes an automated pipeline for table extraction using convolutional neural networks (CNN). The pipeline consists of a table detection module, which detects the presence of tables and extract the table regions using an object detection CNN model, and a table structure recognition module, which extracts table cells and their contents before reconstructing the table structure. To enhance performance of the table detection module, modifications were implemented into the table detection model and evaluated against their non-modified versions. The report will first review existing literature for table detection and table structure recognition. Next, the report introduces the datasets utilized for training, as well as data augmentation methods, the architectures utilized in the evaluation of single-stage approaches and experiments on modifications carried out to improve performance. The evaluation metrics and results will then be presented and discussed. Several experiments carried out in this project were discovered to show promising results over their non-modified counterparts. Additionally, the pipeline was successfully demonstrated to perform table extraction, thus demonstrating the viability of the overall process. | URI: | https://hdl.handle.net/10356/136597 | Schools: | School of Computer Science and Engineering | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Lee Seng Cheong FYP Report.pdf Restricted Access | 2.76 MB | Adobe PDF | View/Open | |
Lee Seng Cheong FYP Poster.pdf Restricted Access | 1.24 MB | Adobe PDF | View/Open | |
alma991016441109405146.html Restricted Access | 138 B | HTML | View/Open |
Page view(s) 50
612
Updated on Mar 28, 2024
Download(s) 50
131
Updated on Mar 28, 2024
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.