Use and Misuse of Continuous Integration Features: An Empirical Study of Projects that (mis)use Travis CI

Authors - Keheliya Gallaba, Shane McIntosh
Venue - IEEE Transactions on Software Engineering, Vol. 46, No. 1, pp. 33–50, 2020

Related Tags - TSE 2020 continuous integration anti-patterns software evolution

Abstract - Continuous Integration (CI) is a popular practice where software systems are automatically compiled and tested as changes appear in the version control system of a project. Like other software artifacts, CI specifications require maintenance effort. Although there are several service providers like Travis CI offering various CI features, it is unclear which features are being (mis)used. In this paper, we present a study of feature use and misuse in 9,312 open source systems that use Travis CI. Analysis of the features that are adopted by projects reveals that explicit deployment code is rare—48.16% of the studied Travis CI specification code is instead associated with configuring job processing nodes. To analyze feature misuse, we propose Hansel—an anti-pattern detection tool for Travis CI specifications. We define four anti-patterns and Hansel detects anti-patterns in the Travis CI specifications of 894 projects in the corpus (9.60%), and achieves a recall of 82.76% in a sample of 100 projects. Furthermore, we propose Gretel—an anti-pattern removal tool for Travis CI specifications, which can remove 69.60% of the most frequently occurring anti-pattern automatically. Using Gretel, we have produced 36 accepted pull requests that remove Travis CI anti-patterns automatically.

Preprint - PDF

Bibtex

@article{gallaba2020tse,
  Author = {Keheliya Gallaba and Shane McIntosh},
  Title = {{Use and Misuse of Continuous Integration Features: An Empirical Study of Projects that (mis)use Travis CI}},
  Year = {2020},
  Journal = {IEEE Transactions on Software Engineering},
  Volume = {46},
  Number = {1},
  Pages = {33–50}
}