Please use this identifier to cite or link to this item:
Title: Sentiment analysis on the web
Authors: Chit, Lin Su.
Keywords: DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
DRNTU::Engineering::Computer science and engineering::Software::Software engineering
Issue Date: 2013
Abstract: In the Information Age, the wide range of Web usage has been increasing due to the advancement in hardware and software technology. As a result of that, the Web becomes the valuable source of massive amount of data contents. Nowadays, large volumes of data are created by Internet users. Among the different kinds of data available on the Web, considerable amount of data comes from social media. This is the place where users express themselves freely in the context of various topics. Therefore, sentiment data has gained increasing attention from both business and consumer to discovery valuable knowledge from these kinds of data. However, in order to accomplish analyzing the sentiment data, step by step processes have to be executed. In this project, software application was developed in order to support all step by step processes involved in sentiment analysis on the Web. Software application was separated into different software components to assist in data collection, data preparation, sentiment analysis, and data visualization processes. Literature studies were done for a better understanding of these processes. Software design methodology was created with the use of Unified Modeling Language (UML) before the actual implementation was performed using Java object oriented programing language in NetBeans Integrated Development Environment (IDE). Software testing was done for each process by using the real world online review data from Amazon web site. Web crawler and parser processed the real world data, and data pre-processor and text processor performed data transformation. Different kinds of sentiment classification techniques such as Naïve Bayes, Sequential Minimal Optimization and k-Nearest Neighbor learning were applied in sentiment analysis on the Web and results were visualized for end users. Classification accuracy results were observed and compared in which SMO performed better than Naïve Bayes and kNN in different scenarios. One of the research works of domain adaption were analyzed and perform experimentations for future direction of sentiment analysis.
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Final Year Project Report.pdf
  Restricted Access
2.73 MBAdobe PDFView/Open

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.