|
Title:
|
Discourse parsing of sociology dissertation abstracts using decision tree induction.
|
|
Author:
|
Ou, Shiyan.; Khoo, Christopher Soo Guan.; Heng, Hui Ying.; Goh, Dion Hoe Lian.
|
|
Copyright year:
|
2003 |
|
Abstract:
|
In this study, we investigated the use of decision tree induction to
parse the macro-level discourse structure of sociology dissertation abstracts.
We treated discourse parsing as a sentence categorization task. The attributes
used in constructing the decision tree models were stemmed words that
occurred in at least 35 sentences (out of 3694 sentences in 300 sample
abstracts). Sentence location information was also used. The model obtained
an accuracy rate of 71.3% when applied to a test sample of 100 abstracts.
Another model that made use of information regarding the presence of 31
indicator words in neighboring sentences was also developed. Although this
model did not obtain better results, a comparison of the two models suggests
that an improvement in the classification of sentences in problem statement
and research method section is possible by combining the models. |
|
Subject:
|
DRNTU::Library and information science. |
|
Type:
|
Conference Paper |
|
Conference name:
|
Proceedings of the 13th Annual ASIST SIG CR Workshop |
|
School:
|
Wee Kim Wee School of Communication and Information |
|
Rights:
|
© 2003 Proceedings of the 13th Annual ASIST SIG CR Workshop. |
|
Version:
|
Accepted version |