Please use this identifier to cite or link to this item:
Title: Automatic document categorization
Authors: Zhou, Anna
Keywords: DRNTU::Engineering
Issue Date: 2016
Abstract: With the increasing popularity of social media network in the recent years, the concerns have been raised for the exposure of cyber bullying. The harmful information brings huge negative impact on the mental health of people who are exposed to them, especially teenagers. Therefore, it is essential to find an effective way of cyber bullying detection. In this paper, we proposed two different models for the text representation and feature extraction. Introduction to the topic and some related work were presented firstly for a better understanding of the topic. Then the concept of the two text representation models Embedding Enhanced Bag-of-Words model and Bullying-Word-Filter model were introduced. In the experiment part, we applied these two models with some manually labeled tweets and did the testing. The performances of prediction scores were illustrated. In the second part, with the classifiers trained in the first part, a case study concentrating on the cyber bullying cases in Singapore was done. It wasshown in the paper that our proposed models outperformed many existing models and worked efficiently in cyber bullying detection. In the future, more works are supposed to be finished.
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP report.pdf
  Restricted Access
4.16 MBAdobe PDFView/Open

Page view(s) 50

checked on Oct 23, 2020

Download(s) 50

checked on Oct 23, 2020

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.