Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184445
Title: Knowledge enhanced deep learning under small data
Authors: Ma, Jiarui
Keywords: Computer and Information Science
Engineering
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Ma, J. (2025). Knowledge enhanced deep learning under small data. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184445
Abstract: Deep learning has become a dominant method for a wide range of natural language processing (NLP) tasks. At present, the majority of deep learning techniques rely entirely on data-driven strategies, typically demanding substantial datasets to develop models that perform effectively. In some applications, however, the data available is very limited. How to train a well-performing model under small data is a challenging yet important issue in NLP. This dissertation investigates enhancing deep learning under small data by integrating domain and linguistic knowledge. Two approaches are proposed: the SetFit model for few-shot text classification, achieving 88.42% accuracy and 88.37% F1 score on the SST-2 dataset with 100 samples, and the K-BERT model, which enhances BERT with knowledge graphs (e.g., CN-DBpedia with 5.17 million triplets, MedicalKG with 13,864 triplets). Tested on tasks like sentiment classification, NER, Q&A, and domain-specific datasets (e.g., Medicine_NER), K-BERT shows significant performance gains over BERT. Key achievements include reduced data dependency, improved interpretability through semantic constraints, and a shift to a "knowledge + data co-driven" NLP paradigm, offering practical solutions for low-resource fields and new insights for future research.
URI: https://hdl.handle.net/10356/184445
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
Ma Jiarui-Dissertation.pdf
  Restricted Access
830.83 kBAdobe PDFView/Open

Page view(s)

28
Updated on May 7, 2025

Download(s)

1
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.