CC2Vec: Distributed Representations of Code Changes - Archive ouverte HAL Access content directly
Conference Papers Year : 2020

CC2Vec: Distributed Representations of Code Changes

(1) , (1) , (1) , (2)
1
2

Abstract

Existing work on software patches often use features specific to a single task. These works often rely on manually identified features, and human effort is required to identify these features for each task. In this work, we propose CC2Vec, a neural network model that learns a representation of code changes guided by their accompanying log messages, which represent the semantic intent of the code changes. CC2Vec models the hierarchical structure of a code change with the help of the attention mechanism and uses multiple comparison functions to identify the differences between the removed and added code. To evaluate if CC2Vec can produce a distributed representation of code changes that is general and useful for multiple tasks on software patches, we use the vectors produced by CC2Vec for three tasks: log message generation, bug fixing patch identification, and just-in-time defect prediction. In all tasks, the models using CC2Vec outperform the state-of-the-art techniques.
Fichier principal
Vignette du fichier
main.pdf (1.19 Mo) Télécharger le fichier

Dates and versions

hal-03030530 , version 1 (30-11-2020)

Identifiers

Cite

Thong Hoang, Hong Jin Kang, David Lo, Julia Lawall. CC2Vec: Distributed Representations of Code Changes. ICSE 2020 - 42nd International Conference on Software Engineering, Jun 2020, Seoul / Virtual, South Korea. pp.518-529, ⟨10.1145/3377811.3380361⟩. ⟨hal-03030530⟩
86 View
508 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More