Please use this identifier to cite or link to this item:
Title: Time series & sentiment analysis of top cryptocurrencies
Authors: Yeoh, Chester Fu Soon
Keywords: Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Yeoh, C. F. S. (2021). Time series & sentiment analysis of top cryptocurrencies. Final Year Project (FYP), Nanyang Technological University, Singapore.
Abstract: Bitcoin, the world’s first cryptocurrency, was created in 2009 by Satoshi Nakamoto. Frustrated by the need of a central financial institute to oversee online payments made from one party to another, Bitcoin would use a blockchain framework that would be “powered by users with no central authority or middlemen” .Fast forward close to a decade later, Bitcoin has seen a meteoric rise in its popularity, exceeding 300,000 transactions per day (Appendix A). Overall, it has also blossomed into a booming market worth over USD 1 Tn (as of 9 March 2021) and ushered in a new era of cryptocurrencies. The benefits of cryptocurrencies such as accessibility and user autonomy (i.e., lack of an intermediary authority) (Reiff, 2021) are well documented. However, most cryptocurrencies suffer from a key drawback - high price fluctuations. Bitcoin, for example, increased its value by 1900% in 2017 before plunging below USD 8,000 per coin up from its high of USD 19,000 in the 2018 Jan/Feb crash. Since cryptocurrency price research is still a relatively unexplored area and equilibrium market prices are poorly understood (Conrad, 2018), this project seeks to conduct a sentiment analysis to explore possible drivers such as social media and news reports that leads to these price fluctuations. Additionally, other non-sentiment factors such as Public Interest, Community Data and Developer Data will also be analysed for their respective price impacts. Under the sentiment analysis portion of this FYP, Reddit and Twitter social media content & articles from 30,000 news sources and blogs will be mined and analysed using a Valence Aware Dictionary for Sentiment Reasoning (VADER) model to facilitate a correlation analysis against prices of top cryptocurrencies such as Bitcoin (BTC) and Ethereum (ETH). Non-sentiment factors are also extracted as inputs for building a Linear Regression model as well as a Random Forest Classifier. The FYP results indicate that there are weak correlations (|r|< 0.1) with news and social media sentiments across both cryptocurrencies. On the other hand, non-sentiment factors were strongly correlated with price across the board (|r|>0.7) and specific factors such as Number of Twitter Followers for a cryptocurrency’s Twitter account (i.e. @Bitcoin) and Number of Reddit Accounts following a cryptocurrency’s subreddit (i.e. r/Bitcoin) appeared to be strong price drivers under the high-accuracy regression and classification models.
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
  Restricted Access
2.94 MBAdobe PDFView/Open

Page view(s) 50

Updated on May 20, 2022

Download(s) 50

Updated on May 20, 2022

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.