Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/156955
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHoang, Nghia Tuyenen_US
dc.date.accessioned2022-05-04T03:33:54Z-
dc.date.available2022-05-04T03:33:54Z-
dc.date.issued2022-
dc.identifier.citationHoang, N. T. (2022). Open domain question answering system. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156955en_US
dc.identifier.urihttps://hdl.handle.net/10356/156955-
dc.description.abstractDeep learning methods have drawn tremendous attention from both the research community and the industrial practitioners thanks to their undeniable power in learning feature representation in higher dimensions without manual, handcrafting features. An application of deep learning that arises naturally is question answering, in which a question answering system must answer questions posed by humans. One of its sub-fields, opendomain question answering, attempts to answer questions about nearly anything, without being given relevant reference texts. Despite its impactful applications in search engines, chatbots and factual correction, research work in open-domain question answering is relatively under-explored due to its complex and large-scale nature. In this work, we aim to advance the progress of recent open-domain question answering systems by developing various mathematical-driven methods. More specifically, in the first part of this thesis, we introduce the widely adopted two-stage paradigm in opendomain question answering and perform comprehensive error analysis on state-of-the-art models. Based on this, we are then able to formulate and develop methods aiming specifically at overcoming these weaknesses in the second part of the thesis. These approaches range from simple methods such as parameter sharing and data augmentation to more sophisticated methods such as designing new objective functions or pseudo data synthesis and semi-supervised learning. Finally, we unify these developed methods into a single framework that outperforms state-of-the-art models by a significant margin on common benchmarking datasets. The code to reproduce our experiments is released at https://github.com/hnt4499/DPR.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineering::Computer science and engineering::Information systems::Information storage and retrievalen_US
dc.subjectScience::Mathematicsen_US
dc.titleOpen domain question answering systemen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorJoty Shafiq Rayhanen_US
dc.contributor.supervisorPun Chi Sengen_US
dc.contributor.schoolSchool of Physical and Mathematical Sciencesen_US
dc.description.degreeBachelor of Science in Mathematical Sciencesen_US
dc.contributor.supervisoremailcspun@ntu.edu.sg, srjoty@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:SPMS Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
Hoang_Nghia_Tuyen_U1840140F_FYP_Final.pdf
  Restricted Access
1.29 MBAdobe PDFView/Open

Page view(s)

42
Updated on Jul 2, 2022

Download(s)

9
Updated on Jul 2, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.