Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/19088
Title: | Automatic information extraction and text mining in medical abstracts. | Authors: | Wang, Wei. | Keywords: | DRNTU::Library and information science | Issue Date: | 2009 | Abstract: | This study is in the area of information extraction (IE), which seeks to extract pieces of related information from unstructured text to populate a database or an ontology. Most IE systems employ a pattern-matching technique to identify the information to be extracted. Patterns are learnt from a large annotated training set, which requires substantial human effort. This study investigates a semi-supervised learning approach to learn IE patterns. The approach uses a small number of seed patterns to automatically generate a training set and learns IE patterns from the training set by an Apriori algorithm. The study is carried out in the context of extracting information related to potential treatments of colon cancer from medical abstracts. It focuses on extracting 3 kinds of semantic relations: • Treatment relation: the disease and its potential medical treatment • Dosage relation: the treatment and its dose • Effect type relation: the treatment and its effect type. The objectives of this study are to develop a method for automatic construction of IE patterns using semi-supervised learning, to develop an IE system for extracting disease-treatment information from medical abstracts, and to develop an ontology for representing disease-treatment information found in medical abstracts. | URI: | http://hdl.handle.net/10356/19088 | Schools: | Wee Kim Wee School of Communication and Information | Rights: | Nanyang Technological University | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | WKWSCI Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
sciw060001.pdf Restricted Access | 911.39 kB | Adobe PDF | View/Open |
Page view(s) 50
512
Updated on Mar 20, 2025
Download(s)
5
Updated on Mar 20, 2025
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.