Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/138594
Title: Classification on big data set using data analytics techniques
Authors: Chung, Ka Wai
Keywords: Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Issue Date: 2020
Publisher: Nanyang Technological University
Project: A3039-191
Abstract: The advancement of big data allows data analytics to grow with the increase in the amount of information that can be processed. As information is more readily available, programs can be created to extract, analyse and classify online social media messages and comments. Existing word dictionaries are based on old literature text and documents and are unable to pick up slang used by users of the internet, as well as languages that are an amalgamation of different dialects and languages such as Singlish. The project aims to create a classification model based on a localised dataset of an online message board to be able to categorise comments whether they are positive, negative or neutral in sentiment. A total of 3 concepts of classification were explored and 5 different models were generated to obtain an accuracy ranging from 57%-64%. A voting classifier consisting of the combination of all 5 models resulted in a higher accuracy of 65.5%. A chatbot was also programmed and interaction with the classification models to evaluate the sentiment of user input. This project can be utilised in social data analytics and metrics to gauge feedback of online comments for news and updates.
URI: https://hdl.handle.net/10356/138594
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
[A3039-191] Classification on Big Data Set using Data Analytics Techniques.pdf
  Restricted Access
1.64 MBAdobe PDFView/Open

Page view(s)

387
Updated on Mar 26, 2025

Download(s) 50

27
Updated on Mar 26, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.