Tree-based Mining of Fine-grained Code Changes to Detect Unknown Change Patterns

被引:2
|
作者
Higo, Yoshiki [1 ]
Matsumoto, Junnosuke [1 ]
Kusumoto, Shinji [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka, Japan
关键词
Mining code change pattern; Repository mining; Edit script; Code change pattern; EFFICIENT ALGORITHM;
D O I
10.1109/APSEC53868.2021.00014
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In software development, source code is repeatedly changed due to various reasons. Similar code changes are called change patterns. Identifying change patterns is useful to support software development in a variety of ways. For example, change patterns can be used to collect ingredients for code completion or automated program repair. Many research studies have proposed various techniques that detect change patterns. For example, Negara et al. proposed a technique that derives change patterns from the edit scripts. Negara's technique can detect fine-grained change patterns, but we consider that there is room to improve their technique. We found that Negara's technique occasionally generates change patterns from structurally-different changes, and we also uncovered that the reason why such change patterns are generated is that their technique performs text comparisons in matching changes. In this study, we propose a new change mining technique to detect change patterns only from structurally-identical changes by taking into account the structure of the abstract syntax trees. We implemented the proposed technique as a tool, TC2P, and we compared it with Negara's technique. As a result, we confirmed that TC2P was not only able to detect change patterns more adequately than the prior technique but also to detect change patterns that were not detected by the prior technique.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 50 条
  • [41] Aspect-Based Sentiment Analysis as Fine-Grained Opinion Mining
    Diaz, Gerardo Ocampo
    Zhang, Xuanming
    Ng, Vincent
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6804 - 6811
  • [42] TMTCPT: The Tree Method based on the Taxonomic Categorization and the Phylogenetic Tree for fine-grained categorization
    Bameri, Fateme
    Pourreza, Hamid-Reza
    Taherinia, Amir-Hossein
    Aliabadian, Mansour
    Mortezapour, Hamid-Reza
    Abdilzadeh, Raziyeh
    BIOSYSTEMS, 2020, 195
  • [43] An Empirical Study on the Characteristics of Python']Python Fine-Grained Source Code Change Types
    Lin, Wei
    Chen, Zhifei
    Ma, Wanwangying
    Chen, Lin
    Xu, Lei
    Xu, Baowen
    32ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2016), 2016, : 189 - 200
  • [44] Adopting data interpretation on mining fine-grained near-repeat patterns in crimes
    Wang, Ke
    Cai, Zhiping
    Zhu, Peidong
    Cui, Pengshuai
    Zhu, Haoyang
    Li, Yangyang
    JOURNAL OF FORENSIC AND LEGAL MEDICINE, 2018, 55 : 76 - 86
  • [45] Method based on contrastive learning for fine-grained unknown malicious traffic classification
    Wang Y.
    Guo Y.
    Chen Q.
    Fang C.
    Lin R.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (10): : 12 - 25
  • [46] Fine-Grained ICD Code Assignment Using Ontology-Based Classification
    Carberry, Joshua
    Xu, Haiping
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 228 - 233
  • [47] Deep Learning Model for Fine-Grained Aspect-Based Opinion Mining
    Abas, Ahmed R.
    El-Henawy, Ibrahim
    Mohamed, Hossam
    Abdellatif, Amr
    IEEE ACCESS, 2020, 8 : 128845 - 128855
  • [48] CNN-Based Sequence Labeling for Fine-Grained Opinion Mining of Microblogs
    Cheng, Jiajun
    Li, Pei
    Zhang, Xin
    Ding, Zhaoyun
    Wang, Hui
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2017, 2017, 10526 : 94 - 103
  • [49] Declarative Visitors to Ease Fine-grained Source Code Mining with Full History on Billions of AST Nodes
    Dyer, Robert
    Rajan, Hridesh
    Nguyen, Tien N.
    ACM SIGPLAN NOTICES, 2014, 49 (03) : 23 - 32
  • [50] Mining fine-grained sequential travel patterns from social geo-tagged photos
    Thanh-Hieu Bui
    Park, Seong-Bae
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (06) : 1255 - 1257