INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers

Anjan Karmakar; Romain Robbes

doi:10.48550/arXiv.2312.05092

Article Dans Une Revue IEEE Transactions on Software Engineering Année : 2024

INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers

(1) , (2, 3, 4, 5)

1
2
3
4
5

Anjan Karmakar

Fonction : Auteur

Free University of Bozen-Bolzano

Romain Robbes

Fonction : Auteur

Centre National de la Recherche Scientifique

Laboratoire Bordelais de Recherche en Informatique

Université de Bordeaux

Institut Polytechnique de Bordeaux

Résumé

Pre-trained models of source code have recently been successfully applied to a wide variety of Software Engineering tasks; they have also seen some practical adoption in practice, e.g. for code completion. Yet, we still know very little about what these pre-trained models learn about source code. In this article, we use probing--simple diagnostic tasks that do not further train the models--to discover to what extent pre-trained models learn about specific aspects of source code. We use an extensible framework to define 15 probing tasks that exercise surface, syntactic, structural and semantic characteristics of source code. We probe 8 pre-trained source code models, as well as a natural language model (BERT) as our baseline. We find that models that incorporate some structural information (such as GraphCodeBERT) have a better representation of source code characteristics. Surprisingly, we find that for some probing tasks, BERT is competitive with the source code models, indicating that there are ample opportunities to improve source-code specific pre-training on the respective code characteristics. We encourage other researchers to evaluate their models with our probing task suite, so that they may peer into the hidden layers of the models and identify what intrinsic code characteristics are encoded.

Mots clés

Software Engineering (cs.SE) Machine Learning (cs.LG) FOS: Computer and information sciences Machine Learning for Source Code Probing Benchmarking Transformers Pre-trained models ✦ Machine Learning for Source Code Probing Benchmarking Transformers Pre-trained models ✦

Domaines

Informatique [cs]

Fichier principal

2312.05092v1.pdf (9.31 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Romain Robbes : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04797531

Soumis le : vendredi 22 novembre 2024-10:41:31

Dernière modification le : mercredi 27 novembre 2024-03:40:28

Dates et versions

hal-04797531 , version 1 (22-11-2024)

Identifiants

HAL Id : hal-04797531 , version 1
DOI : 10.48550/arXiv.2312.05092

Citer

Anjan Karmakar, Romain Robbes. INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers. IEEE Transactions on Software Engineering, 2024, 50 (2), ⟨10.48550/arXiv.2312.05092⟩. ⟨hal-04797531⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-BORDEAUX

0 Consultations

0 Téléchargements

INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager