Skip to Main content Skip to Navigation
Conference papers

Untangling Fine-Grained Code Changes

Martín Dias 1 Alberto Bacchelli 2 Georgios Gousios 3 Damien Cassou 1 Stéphane Ducasse 1 
1 RMOD - Analyses and Languages Constructs for Object-Oriented Application Evolution
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189
SERG - Software Engineering Research Group [Delft]
3 Digital Security Group [Nijmegen]
ICIS - Institute for Computing and Information Sciences [Nijmegen]
Abstract : After working for some time, developers commit their code changes to a version control system. When doing so, they often bundle unrelated changes (e.g., bug fix and refactoring) in a single commit, thus creating a so-called tangled commit. Sharing tangled commits is problematic because it makes review, reversion, and integration of these commits harder and historical analyses of the project less reliable. Researchers have worked at untangling existing commits, i.e., finding which part of a commit relates to which task. In this paper, we contribute to this line of work in two ways: (1) A publicly available dataset of untangled code changes, created with the help of two developers who accurately split their code changes into self contained tasks over a period of four months; (2) a novel approach, EpiceaUntangler, to help developers share untangled commits (aka. atomic commits) by using fine-grained code change information. EpiceaUntangler is based and tested on the publicly available dataset, and further evaluated by deploying it to 7 developers, who used it for 2 weeks. We recorded a median success rate of 91% and average one of 75%, in automatically creating clusters of untangled fine-grained code changes.
Complete list of metadata
Contributor : Lse Lse Connect in order to contact the contributor
Submitted on : Friday, February 20, 2015 - 10:47:09 AM
Last modification on : Wednesday, March 23, 2022 - 3:51:21 PM
Long-term archiving on: : Thursday, May 28, 2015 - 11:55:48 AM


Files produced by the author(s)




  • HAL Id : hal-01116225, version 1
  • ARXIV : 1502.06757


Martín Dias, Alberto Bacchelli, Georgios Gousios, Damien Cassou, Stéphane Ducasse. Untangling Fine-Grained Code Changes. SANER: International Conference on Software Analysis, Evolution, and Reengineering, Mar 2015, Montréal, Canada. ⟨hal-01116225⟩



Record views


Files downloads