Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/147554
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Khan, Mohammad Sadique | en_US |
dc.date.accessioned | 2021-04-12T13:10:58Z | - |
dc.date.available | 2021-04-12T13:10:58Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Khan, M. S. (2020). Sentiment analysis of context word embeddings. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/147554 | en_US |
dc.identifier.uri | https://hdl.handle.net/10356/147554 | - |
dc.description.abstract | With every technological advancement, the role of machines in our lives are getting augmented and now, more than ever, there is a need to communicate with machines naturally. Natural communication includes more than just recognition of pre-defined commands by the machines, it includes but is not limited to the understanding of contextual and sentiment information of the conversation. There is a separate branch in Computer sciences and AI which deals with the interaction between computer and human called natural language processing (NLP). NLP encompasses many aspects of communication between machine and human such as speech recognition, syntactic analysis, and lexical semantics. But this study is limited to a part of NLP which is concerned with sentiment extraction and classification of text based on its sentiment. Classification of text is done into binary or ternary classes based on its sentiment. IMDB, SST and SemEval databases provide pre-labelled sentences specially curated for sentiment analysis tasks. The mentioned datasets are used for training and testing the classification models developed in this thesis using deep learning architecture. Embedding algorithm such as BERT framework is used for extracting word embedding. CNN and Fully connected deep neural network architecture are used to develop the classification models for classifying text in binary and ternary sentiment labels. The different classification models are compared against each other on different metrics such as Macro-F1, accuracy, precision, and recall. BERT embedding with CNN classifier is found to perform better on all datasets compared to all other classifier models discussed in this thesis. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nanyang Technological University | en_US |
dc.subject | Engineering::Electrical and electronic engineering::Computer hardware, software and systems | en_US |
dc.title | Sentiment analysis of context word embeddings | en_US |
dc.type | Thesis-Master by Coursework | en_US |
dc.contributor.supervisor | Ponnuthurai Nagaratnam Suganthan | en_US |
dc.contributor.school | School of Electrical and Electronic Engineering | en_US |
dc.description.degree | Master of Science (Computer Control and Automation) | en_US |
dc.contributor.supervisoremail | EPNSugan@ntu.edu.sg | en_US |
item.fulltext | With Fulltext | - |
item.grantfulltext | restricted | - |
Appears in Collections: | EEE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Sadique(G1902143G) Dissertation.pdf Restricted Access | Final Dissertation | 4.56 MB | Adobe PDF | View/Open |
Page view(s)
247
Updated on Mar 28, 2024
Download(s)
5
Updated on Mar 28, 2024
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.