Alleviating Patch Overfitting with Automatic Test Generation: A Study of Feasibility and Effectiveness for the Nopol Repair System

Abstract : Among the many different kinds of program repair techniques, one widely studied family of techniques is called test suite based repair. However, test suites are in essence input-output specifications and are thus typically inadequate for completely specifying the expected behavior of the program under repair. Consequently, the patches generated by test suite based repair techniques can just overfit to the used test suite, and fail to generalize to other tests. We deeply analyze the overfitting problem in program repair and give a classification of this problem. This classification will help the community to better understand and design techniques to defeat the overfitting problem. We further propose and evaluate an approach called UnsatGuided, which aims to alleviate the overfitting problem for synthesis-based repair techniques with automatic test case generation. The approach uses additional automatically generated tests to strengthen the repair constraint used by synthesis-based repair techniques. We analyze the effectiveness of UnsatGuided: 1) analytically with respect to alleviating two different kinds of overfitting issues; 2) empirically based on an experiment over the 224 bugs of the Defects4J repository. The main result is that automatic test generation is effective in alleviating one kind of overfitting issue–regression introduction, but due to oracle problem, has minimal positive impact on alleviating the other kind of overfitting issue–incomplete fixing.
Type de document :
Article dans une revue
Empirical Software Engineering, Springer Verlag, In press, 〈10.1007/s10664-018-9619-4〉
Liste complète des métadonnées

Littérature citée [62 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01774223
Contributeur : Zhongxing Yu <>
Soumis le : lundi 23 avril 2018 - 14:44:56
Dernière modification le : mardi 3 juillet 2018 - 11:23:26

Fichier

alleviating_Overfitting.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Zhongxing Yu, Matias Martinez, Benjamin Danglot, Thomas Durieux, Martin Monperrus. Alleviating Patch Overfitting with Automatic Test Generation: A Study of Feasibility and Effectiveness for the Nopol Repair System. Empirical Software Engineering, Springer Verlag, In press, 〈10.1007/s10664-018-9619-4〉. 〈hal-01774223〉

Partager

Métriques

Consultations de la notice

170

Téléchargements de fichiers

136