μBERT: Mutation Testing using Pre-Trained Language Models

被引：6

作者：

Degiovanni, Renzo ^{[1
]}

Papadakis, Mike ^{[1
]}

机构：

[1] Univ Luxembourg, SnT, Luxembourg, Luxembourg

来源：

2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2022) | 2022年

关键词：

D O I：

10.1109/ICSTW55395.2022.00039

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We introduce mu BERT, a mutation testing tool that uses a pre-trained language model (CodeBERT) to generate mutants. This is done by masking a token from the expression given as input and using CodeBERT to predict it. Thus, the mutants are generated by replacing the masked tokens with the predicted ones. We evaluate mu BERT on 40 real faults from Defects4J and show that it can detect 27 out of the 40 faults, while the baseline (PiTest) detects 26 of them. We also show that mu BERT can be 2 times more cost-effective than PiTest, when the same number of mutants are analysed. Additionally, we evaluate the impact of mu BERT's mutants when used by program assertion inference techniques, and show that they can help in producing better specifications. Finally, we discuss about the quality and naturalness of some interesting mutants produced by mu BERT during our experimental evaluation.

引用

页码：160 / 169

页数：10

共 50 条

[1] Effective test generation using pre-trained Large Language Models and mutation testing
Dakhel, Arghavan Moradi
Nikanjam, Amin
Majdinasab, Vahid
Khomh, Foutse
Desmarais, Michel C.
[J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 171
[2] Emotional Paraphrasing Using Pre-trained Language Models
Casas, Jacky
Torche, Samuel
Daher, Karl
Mugellini, Elena
Abou Khaled, Omar
[J]. 2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2021,
[3] BERT-MK: Integrating Graph Contextualized Knowledge into Pre-trained Language Models
He, Bin
Zhou, Di
Xiao, Jinghui
Jiang, Xin
Liu, Qun
Yuan, Nicholas Jing
Xu, Tong
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2281 - 2290
[4] Pre-Trained Language Models and Their Applications
Wang, Haifeng
Li, Jiwei
Wu, Hua
Hovy, Eduard
Sun, Yu
[J]. ENGINEERING, 2023, 25 : 51 - 65
[5] A pre-trained BERT for Korean medical natural language processing
Kim, Yoojoong
Kim, Jong-Ho
Lee, Jeong Moon
Jang, Moon Joung
Yum, Yun Jin
Kim, Seongtae
Shin, Unsub
Kim, Young-Min
Joo, Hyung Joon
Song, Sanghoun
[J]. SCIENTIFIC REPORTS, 2022, 12 (01)
[6] A pre-trained BERT for Korean medical natural language processing
Yoojoong Kim
Jong-Ho Kim
Jeong Moon Lee
Moon Joung Jang
Yun Jin Yum
Seongtae Kim
Unsub Shin
Young-Min Kim
Hyung Joon Joo
Sanghoun Song
[J]. Scientific Reports, 12
[7] Devulgarization of Polish Texts Using Pre-trained Language Models
Klamra, Cezary
Wojdyga, Grzegorz
Zurowski, Sebastian
Rosalska, Paulina
Kozlowska, Matylda
Ogrodniczuk, Maciej
[J]. COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 49 - 55
[8] MERGEDISTILL: Merging Pre-trained Language Models using Distillation
Khanuja, Simran
Johnson, Melvin
Talukdar, Partha
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2874 - 2887
[9] Issue Report Classification Using Pre-trained Language Models
Colavito, Giuseppe
Lanubile, Filippo
Novielli, Nicole
[J]. 2022 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING (NLBSE 2022), 2022, : 29 - 32
[10] Automated Assessment of Inferences Using Pre-Trained Language Models
Yoo, Yongseok
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (09):

← 1 2 3 4 5 →