Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/138643
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChen, Zhiweien_US
dc.date.accessioned2020-05-11T06:18:13Z-
dc.date.available2020-05-11T06:18:13Z-
dc.date.issued2020-
dc.identifier.urihttps://hdl.handle.net/10356/138643-
dc.description.abstractMany text classification tasks face the challenge of lack of sufficient la- belled data. Co-training algorithm is a candidate solution, which learns from both labeled and unlabelled data for better classification accuracy. However, two sufficient and redundant views of an instance are often not available to fully facilitate co-training in the past. With the recent develop- ment of deep learning, we now have both traditional TFIDF representation and deep representation for documents. In this paper, we conduct exper- iments to evaluate the effectiveness of co-training with different combina- tions of document representations (e.g., TFIDF, Doc2vec, ELMo, BERT) and classifiers (e.g., SVM, Random Forest, XGBoost, MLP, and CNN) on two benchmark datasets (20 Newsgroup and Ohsumed). Our results show that co-training with TFIDF and deep contextualised representation offers improvement to classification accuracy.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectEngineering::Computer science and engineering::Computing methodologies::Document and text processingen_US
dc.titleTFIDF meets deep document representation : a re-visit of co-training for text classificationen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorSun Aixinen_US
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
dc.contributor.supervisoremailAXSun@ntu.edu.sgen_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
FYP_final_report_v1.pdf
  Restricted Access
1.7 MBAdobe PDFView/Open

Page view(s)

391
Updated on Apr 29, 2025

Download(s)

26
Updated on Apr 29, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.