Please use this identifier to cite or link to this item:
Title: Text mining for building proteins interaction networks
Authors: Chen, Lucan.
Keywords: DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Issue Date: 2012
Abstract: Text mining algorithm is an important method for extracting information from biomedical literatures. Precious text mining algorithms are not specific to biological domain. Our purpose is to find an algorithm that is most suitable for biological domain. Changes and improvements have been done to the existing algorithms. A new algorithm is also designed. The existing text mining algorithms are investigated and implemented first. Pattern Matching Algorithm is a commonly-used straightforward algorithm. Results from our implementation show that the performance is not good enough. A new algorithm, Terms Association Algorithm is therefore designed and implemented. Results of Terms Association Algorithm show that it’s suitable for English biomedical literature. Its performance is better than the existing text mining algorithm, especially under biological domain. After determining the best text mining algorithm for biological domain, the text mining algorithm was integrated together with OSEE, KEGG pathways and IntAct to construct gene regulation networks and protein-protein interaction networks. The overall performance of the constructed networks is investigated. Our new text mining algorithm hasshow in constructing biological networks.
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
  Restricted Access
999.6 kBAdobe PDFView/Open

Page view(s)

checked on Sep 30, 2020


checked on Sep 30, 2020

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.