A ML-LLM pairing for better code comment classification

Hanna Abi Akl

Communication Dans Un Congrès Année : 2023

A ML-LLM pairing for better code comment classification

(1, 2)

1
2

Hanna Abi Akl

Fonction : Auteur

Data ScienceTech Institute

Web-Instrumented Man-Machine Interactions, Communities and Semantics

Résumé

The "Information Retrieval in Software Engineering (IRSE)" at FIRE 2023 shared task introduces code comment classification, a challenging task that pairs a code snippet with a comment that should be evaluated as either useful or not useful to the understanding of the relevant code. We answer the code comment classification shared task challenge by providing a two-fold evaluation: from an algorithmic perspective, we compare the performance of classical machine learning systems and complement our evaluations from a data-driven perspective by generating additional data with the help of large language model (LLM) prompting to measure the potential increase in performance. Our best model, which took second place in the shared task, is a Neural Network with a Macro-F1 score of 88.401% on the provided seed data and a 1.5% overall increase in performance on the data generated by the LLM.

Mots clés

Natural Language Processing Machine Learning Information Retrieval Large Language Models Code Comprehension Comment Quality

Domaines

Intelligence artificielle [cs.AI] Génie logiciel [cs.SE]

Fichier principal

FIRE_IRSE_2023.pdf (810.82 Ko)

Origine : Fichiers produits par l'(les) auteur(s)
Licence : CC BY - Paternité

Hanna ABI AKL : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-04311401

Soumis le : mardi 28 novembre 2023-10:56:30

Dernière modification le : lundi 26 février 2024-11:22:08

Dates et versions

hal-04311401 , version 1 (28-11-2023)

Licence

Paternité

Identifiants

HAL Id : hal-04311401 , version 1
ARXIV : 2310.10275

Citer

Hanna Abi Akl. A ML-LLM pairing for better code comment classification. FIRE (Forum for Information Retrieval Evaluation) 2023, Prasenjit Majumder; Kripabandhu Ghosh; Thomas Mandl; Debasis Ganguly; Parth Gupta; Bhaskar Mitra; Srijoni Majumdar; Jyoti D Pawar; Pabitra Mitra; Parth Mehta, Dec 2023, Goa, India. ⟨hal-04311401⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA I3S WIMMICS INRIA2 UNIV-COTEDAZUR

164 Consultations

39 Téléchargements

A ML-LLM pairing for better code comment classification

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager