Mining analogical libraries in Q&A discussions - Incorporating relational and categorical knowledge into word embedding

Chunyang Chen, Sa Gao, Zhenchang Xing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

67 Citations (Scopus)

Abstract

Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-libraries questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. We implement our approach in a proof-of-concept web application (https://graphofknowledge.appspot.com/similartech). The evaluation results show that our approach can make accurate recommendation of analogical libraries (Precision@1=0.81 and Precision@5=0.67). Google Analytics of the website traffic provides initial evidence of the potential usefulness of our web application for software developers.

Original languageEnglish
Title of host publication2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering, SANER 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages338-348
Number of pages11
ISBN (Electronic)9781509018550
DOIs
Publication statusPublished - 20 May 2016
Externally publishedYes
Event23rd IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2016 - Suita, Osaka, Japan
Duration: 14 Mar 201618 Mar 2016

Publication series

Name2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering, SANER 2016
Volume1

Conference

Conference23rd IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2016
Country/TerritoryJapan
CitySuita, Osaka
Period14/03/1618/03/16

Fingerprint

Dive into the research topics of 'Mining analogical libraries in Q&A discussions - Incorporating relational and categorical knowledge into word embedding'. Together they form a unique fingerprint.

Cite this