Sentence embedding and fine-tuning to automatically identify duplicate bugs

被引:0
|
作者
Isotani, Haruna [1 ]
Washizaki, Hironori [1 ]
Fukazawa, Yoshiaki [1 ]
Nomoto, Tsutomu [2 ]
Ouji, Saori [3 ]
Saito, Shinobu [3 ]
机构
[1] Waseda Univ, Dept Comp Sci & Engn, Tokyo, Japan
[2] NTT CORP, Software Innovat Ctr, Tokyo, Japan
[3] NTT CORP, Comp & Data Sci Labs, Tokyo, Japan
来源
关键词
bug reports; duplicate detection; BERT; sentence embedding; natural language processing; information retrieval;
D O I
10.3389/fcomp.2022.1032452
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Industrial software maintenance is critical but burdensome. Activities such as detecting duplicate bug reports are often performed manually. Herein an automated duplicate bug report detection system improves maintenance efficiency using vectorization of the contents and deep learning-based sentence embedding to calculate the similarity of the whole report from vectors of individual elements. Specifically, sentence embedding is realized using Sentence-BERT fine tuning. Additionally, its performance is experimentally compared to baseline methods to validate the proposed system. The proposed system detects duplicate bug reports more effectively than existing methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Duplicate Bug Report Detection by Using Sentence Embedding and Fine-tuning
    Isotani, Haruna
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    Nomoto, Tsutomu
    Ouji, Saori
    Saito, Shinobu
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 535 - 544
  • [2] Beyond Fine-tuning: Few-Sample Sentence Embedding Transfer
    Garg, Siddhant
    Sharma, Rohit Kumar
    Liang, Yingyu
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 460 - 469
  • [3] Fine-tuning biomaterial degradation by embedding hydrolytic enzymes
    Ganesh, Manoj
    Gross, Richard
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 239
  • [4] Fine-Tuning the Expression of Duplicate Genes by Translational Regulation in Arabidopsis and Maize
    Wang, Sishuo
    Chen, Youhua
    FRONTIERS IN PLANT SCIENCE, 2019, 10
  • [5] Quantum device fine-tuning using unsupervised embedding learning
    van Esbroeck, N. M.
    Lennon, D. T.
    Moon, H.
    Nguyen, V
    Vigneau, F.
    Camenzind, L. C.
    Yu, L.
    Zumbuehl, D. M.
    Briggs, G. A. D.
    Sejdinovic, D.
    Ares, N.
    NEW JOURNAL OF PHYSICS, 2020, 22 (09):
  • [6] Embedding Hallucination for Few-Shot Language Fine-tuning
    Jian, Yiren
    Gao, Chongyang
    Vosoughi, Soroush
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5522 - 5530
  • [7] Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning
    Ghalandari, Demian Gholipour
    Hokamp, Chris
    Ifrim, Georgiana
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1267 - 1280
  • [8] Fine-tuning
    不详
    AVIATION WEEK & SPACE TECHNOLOGY, 2001, 155 (02): : 21 - 21
  • [9] Fine-tuning
    Rachel Smallridge
    Nature Reviews Molecular Cell Biology, 2004, 5 (2) : 79 - 79
  • [10] Fine-Tuning
    Manson, Neil A.
    TPM-THE PHILOSOPHERS MAGAZINE, 2019, (86): : 99 - 105