Automated change-prone class prediction on unlabeled dataset using unsupervised method

被引:18
|
作者
Yan, Meng [2 ]
Zhang, Xiaohong [1 ,2 ]
Liu, Chao [2 ]
Xu, Ling [2 ]
Yang, Mengning [2 ]
Yang, Dan [2 ]
机构
[1] Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Software Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Software maintenance; Change-prone prediction; Unlabeled dataset; Unsupervised prediction; OBJECT-ORIENTED SOFTWARE; OPEN-SOURCE PRODUCTS; MAINTAINABILITY; METRICS; CODE; PROJECT; MODELS; SUITE;
D O I
10.1016/j.infsof.2017.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software change-prone class prediction can enhance software decision making activities during software maintenance (e.g., resource allocating). Researchers have proposed many change-prone class prediction approaches and most are effective on labeled datasets (projects with historical labeled data). These approaches usually build a supervised model by learning from historical labeled data. However, a major challenge is that this typical change-prone prediction setting cannot be used for unlabeled datasets (e.g., new projects or projects with limited historical data). Although the cross-project prediction is a solution on unlabeled dataset, it needs the prior labeled data from other projects and how to select the appropriate training project is a difficult task. Objective: We aim to build a change-prone class prediction model on unlabeled datasets without the need of prior labeled data. Method: We propose to tackle this task by adopting a state-of-art unsupervised method, namely CLAMI. In addition, we propose a novel unsupervised approach CLAMI+ by extending CLAMI. The key idea is to enable change-prone class prediction on unlabeled dataset by learning from itself. Results: The experiments among 14 open source projects show that the unsupervised methods achieve comparable results to the typical supervised within-project and cross-project prediction baselines in average and the proposed CLAMI+ slightly improves the CLAMI method in average. Conclusion: Our method discovers that it is effective for building change-prone class prediction model by using unsupervised method. It is convenient for practical usage in industry, since it does not need prior labeled data. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [21] PREDICTION OF CHANGES IN VEGETATION DISTRIBUTION UNDER CLIMATE CHANGE SCENARIOS USING MODIS DATASET
    Hirayama, Hidetake
    Tomita, Mizuki
    Hara, Keitarou
    XXIII ISPRS CONGRESS, COMMISSION VIII, 2016, 41 (B8): : 883 - 887
  • [22] Prediction of prone-to-supine tumor displacement in the breast using patient position change: investigation with prone MRI and supine CT
    Hiroko Satake
    Satoko Ishigaki
    Mariko Kitano
    Shinji Naganawa
    Breast Cancer, 2016, 23 : 149 - 158
  • [23] Prediction of prone-to-supine tumor displacement in the breast using patient position change: investigation with prone MRI and supine CT
    Satake, Hiroko
    Ishigaki, Satoko
    Kitano, Mariko
    Naganawa, Shinji
    BREAST CANCER, 2016, 23 (01) : 149 - 158
  • [24] PREDICTION OF PERIVENTRICULAR LEUKOMALACIA OCCURRENCE IN NEONATES USING A NOVEL UNSUPERVISED LEARNING METHOD
    Bender, Dieter
    Jalali, Ali
    Nataraj, C.
    7TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2014, VOL 2, 2014,
  • [25] Unsupervised Change Point Detection Using a Weight Graph Method for Process Monitoring
    An, Ruqiao
    Yang, Chunjie
    Pan, Yijun
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (04) : 1624 - 1634
  • [26] A SEMI-AUTOMATED STRUCTURAL CLASS DEPENDENT METHOD FOR THE PREDICTION OF PROTEIN SECONDARY STRUCTURES
    Popov, I.
    Vassilev, D.
    BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2010, 24 (03) : 2044 - 2048
  • [27] SGBBA: An Efficient Method for Prediction System in Machine Learning using Imbalance Dataset
    Islam, Saiful
    Sara, Umme
    Kawsar, Abu
    Rahman, Anichur
    Kundu, Dipanjali
    Dipta, Diganta Das
    Karim, A. N. M. Rezaul
    Hasan, Mahedi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 430 - 441
  • [28] UNSUPERVISED MULTI-CLASS CHANGE DETECTION IN BITEMPORAL MULTISPECTRAL IMAGES USING BAND EXPANSION
    Liu, Sicong
    Du, Qian
    Bruzzone, Lorenzo
    Samat, Alim
    Tong, Xiaohua
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 1910 - 1913
  • [29] Numerical prediction of phase change using the enthalpy method
    Choi, WR
    Ryou, HS
    Choi, YK
    Ryu, SY
    HEAT TRANSFER 1998, VOL 7: GENERAL PAPERS, 1998, : 127 - 131
  • [30] Prediction of change prone classes using evolution-based and object-oriented metrics
    Malhotra, Ruchika
    Khanna, Megha
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1755 - 1766