Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/69477
Title: On domain knowledge organization and extraction in software engineering
Authors: Ye, Deheng
Keywords: DRNTU::Engineering::Computer science and engineering
Issue Date: 2017
Source: Ye, D. (2017). On domain knowledge organization and extraction in software engineering. Doctoral thesis, Nanyang Technological University, Singapore.
Abstract: Developers' social information seeking on the Web is unable to benefit from the recent significant advances of semantics-oriented applications, such as knowledge graph and direct answers. This is largely because existing approaches to analyzing software engineering social content, such as the discussions on Stack Overflow, 1) treat software-specific entities in the same way as other textual content, and 2) fall short to consider the semantic linkages between software knowledge. In this thesis, we perform a pioneering study towards the long-term goal of enabling domain-specific knowledge graph and semantic search in software engineering. Using the developer-generated content on Stack Overflow, we formulate a series of research problems that are the key steps for achieving this goal. These include: 1) we investigate the online knowledge connection in software engineering by analyzing the knowledge network formed by Stack Overflow users' URL sharing activities. Through this study, we obtain an overall understanding of the domain knowledge organization, correlation and evolution, which inspires further research on extracting and linking software engineering knowledge. 2) we propose semi-supervised methods for extracting software-specific named entities, such as API mentions, from informal natural language text. 3) we develop automated techniques to link semantically linkable knowledge at document-level, and to link a recognized API mention to its fully qualified form as appeared in the API documentation at entity-level. We investigate the development and enhancement of NLP and IR techniques for the design challenges of these research problems brought by the socio-technical nature of software engineering social content. Extensive experiments show the effectiveness of our proposed approaches for analyzing and solving these problems.
URI: http://hdl.handle.net/10356/69477
DOI: 10.32657/10356/69477
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
mainOneSide.pdfPhD thesis of Deheng YE2.71 MBAdobe PDFThumbnail
View/Open

Page view(s)

216
Updated on Oct 19, 2021

Download(s) 50

128
Updated on Oct 19, 2021

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.