Understanding systematic and collaborative code changes by mining evolutionary trajectory patterns

Qingtao Jiang, Xin Peng*, Hai Wang, Zhenchang Xing, Wenyun Zhao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

The life cycle of a large-scale software system can undergo many releases. Each release often involves hundreds or thousands of revisions committed by many developers over time. Many code changes are made in a systematic and collaborative way. However, such systematic and collaborative code changes are often undocumented and hidden in the evolution history of a software system. It is desirable to recover commonalities and associations among dispersed code changes in the evolutionary trajectory of a software system. In this paper, we present Summarizing Evolutionary Trajectory by Grouping and Aggregation (SETGA), an approach to summarizing historical commit records as trajectory patterns by grouping and aggregating relevant code changes committed over time. The SETGA extracts change operations from a series of commit records from version control systems. It then groups extracted change operations by their common properties from different dimensions such as change operation types, developers, and change locations. After that, SETGA aggregates relevant change operation groups by mining various associations among them. We implement SETGA and conduct an empirical study with 3 open-source systems. We investigate underlying evolution rules and problems that can be revealed by the identified patterns and analyze the evolution of trajectory patterns in different periods. The results show that SETGA can identify various types of trajectory patterns that are useful for software evolution management and quality assurance.

Original languageEnglish
Article numbere1840
JournalJournal of software: Evolution and Process
Volume29
Issue number3
DOIs
Publication statusPublished - 1 Mar 2017
Externally publishedYes

Fingerprint

Dive into the research topics of 'Understanding systematic and collaborative code changes by mining evolutionary trajectory patterns'. Together they form a unique fingerprint.

Cite this