Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/44614
Title: Text mining with minimal human supervision
Authors: Ling, Hong Yao.
Keywords: DRNTU::Engineering::Computer science and engineering
Issue Date: 2011
Abstract: In a National Basketball Association match, head coaches often have to make good and timely decisions in the best interest of his team. To make good decisions, it is important that coaches know and recognize his players’ past performances and records.This report explores the various aspects in the creation and implementation of the system. The main objective of this project is to develop a system, NBA Automated Extraction System (NAXS), which uses the text mining technology and it follows the manual approach of identifying data patterns. The system automatically crawls the NBA Web site to search for games as specified by the user and extract useful information from these games. The proposed system was evaluated for the precision of the extraction procedure though various tests. These tests comprise of data taken from 3 months and include the extraction of i) The number of games as well as ii) The extraction of individual player’s statistics. All the tests performed well with each having a percentage of above 99%. The average accuracy of the extraction procedure of NAXS based on the data taken from 3 months is 99.80%. In conclusion, NAXS proved to be almost as efficient as counting the data manually and the automation process is also much faster as compared to the manually counting process.
URI: http://hdl.handle.net/10356/44614
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
SCE10-0188.pdf
  Restricted Access
1.17 MBAdobe PDFView/Open

Page view(s) 50

280
Updated on Dec 1, 2020

Download(s) 50

20
Updated on Dec 1, 2020

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.