Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/152782
Title: Mining likely analogical APIs across third-party libraries via large-scale unsupervised API semantics embedding
Authors: Chen, Chunyang
Xing, Zhenchang
Liu, Yang
Xiong, Kent Long Xiong
Keywords: Engineering::Computer science and engineering
Issue Date: 2021
Source: Chen, C., Xing, Z., Liu, Y. & Xiong, K. L. X. (2021). Mining likely analogical APIs across third-party libraries via large-scale unsupervised API semantics embedding. IEEE Transactions On Software Engineering, 47(3), 432-447. https://dx.doi.org/10.1109/TSE.2019.2896123
Journal: IEEE Transactions on Software Engineering
Abstract: Establishing API mappings between third-party libraries is a prerequisite step for library migration tasks. Manually establishing API mappings is tedious due to the large number of APIs to be examined. Having an automatic technique to create a database of likely API mappings can significantly ease the task. Unfortunately, existing techniques either adopt supervised learning mechanism that requires already-ported or functionality similar applications across major programming languages or platforms, which are difficult to come by for an arbitrary pair of third-party libraries, or cannot deal with lexical gap in the API descriptions of different libraries. To overcome these limitations, we present an unsupervised deep learning based approach to embed both API usage semantics and API description (name and document) semantics into vector space for inferring likely analogical API mappings between libraries. Based on deep learning models trained using tens of millions of API call sequences, method names and comments of 2.8 millions of methods from 135,127 GitHub projects, our approach significantly outperforms other deep learning or traditional information retrieval (IR) methods for inferring likely analogical APIs. We implement a proof-of-concept website (https://similarapi.appspot.com) which can recommend analogical APIs for 583,501 APIs of 111 pairs of analogical Java libraries with diverse functionalities. This scale of third-party analogical-API database has never been achieved before.
URI: https://hdl.handle.net/10356/152782
ISSN: 0098-5589
DOI: 10.1109/TSE.2019.2896123
Rights: © 2019 IEEE. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SCSE Journal Articles

Page view(s)

37
Updated on Nov 30, 2021

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.