Please use this identifier to cite or link to this item:
Title: Semantic text analytic services on the cloud
Authors: Ng, Wei Kok.
Keywords: DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Issue Date: 2013
Abstract: Text Analytics has applications in many areas. However, when Big Data is involved, there is a need to implement Text Analytic Services on the Cloud, due to the limitations of individual machines. Nonetheless, performing computation on large volumes of data is difficult. Even if an implementation works on 10 machines, to scale up to 100s or even 1000s of machines would require major changes to the implementation. The Hadoop framework ensures reliability and availability using unreliable commodity hardware. Hadoop also has a linear scaling, even when scaling up by orders of magnitude, giving Hadoop an advantage over other forms of distributed computing. By implementing GATE, a widely used Text Analytic tool, on the Hadoop framework using MapReduce, this project aims to enable Text Analysis on Big Data, with the linear scaling provided by the Hadoop framework. Performance analysis of the implementation shows that there is indeed a linear scaling when processing with increasing number of machines. As GATE can be used for a multitude of Text Analytic purposes, this implementation will allow the analysis of Big Data in many areas of interest.
Schools: School of Electrical and Electronic Engineering 
Organisations: A*STAR Institute for Infocomm Research
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
  Restricted Access
FYP Report931.87 kBAdobe PDFView/Open

Page view(s)

Updated on Jun 18, 2024


Updated on Jun 18, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.