A component recommender for bug reports using Discriminative Probability Latent Semantic Analysis

被引:24
|
作者
Yan, Meng [2 ]
Zhang, Xiaohong [1 ,2 ,3 ]
Yang, Dan [2 ]
Xu, Ling [2 ]
Kymer, Jeffrey D. [2 ]
机构
[1] State Key Lab Coal Mine Disaster Dynam & Control, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Software Engn, Chongqing 401331, Peoples R China
[3] Soc Minist Educ, Key Lab Dependable Serv Comp Cyber Phys, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Bug reports; Discriminative topic model; Component recommendation; Bug triage;
D O I
10.1016/j.infsof.2016.01.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: The component field in a bug report provides important location information required by developers during bug fixes. Research has shown that incorrect component assignment for a bug report often causes problems and delays in bug fixes. A topic model technique, Latent Dirichlet Allocation (LDA), has been developed to create a component recommender for bug reports. Objective: We seek to investigate a better way to use topic modeling in creating a component recommender. Method: This paper presents a component recommender by using the proposed Discriminative Probability Latent Semantic Analysis (DPLSA) model and Jensen-Shannon divergence (DPLSA-JS). The proposed DPLSA model provides a novel method to initialize the word distributions for different topics. It uses the past assigned bug reports from the same component in the model training step. This results in a correlation between the learned topics and the components. Results: We evaluate the proposed approach over five open source projects, Mylyn, Gcc, Platform, Bugzilla and Firefox. The results show that the proposed approach on average outperforms the LDA-KL method by 30.08%, 19.60% and 14.13% for recall @1, recall @3 and recall @5, outperforms the LDA-SVM method by 31.56%, 17.80% and 8.78% for recall @1, recall @3 and recall @5, respectively. Conclusion: Our method discovers that using comments in the DPLSA-JS recommender does not always make a contribution to the performance. The vocabulary size does matter in DPLSA-JS. Different projects need to adaptively set the vocabulary size according to an experimental method. In addition, the correspondence between the learned topics and components in DPLSA increases the discriminative power of the topics which is useful for the recommendation task. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:37 / 51
页数:15
相关论文
共 50 条
  • [1] USING LATENT SEMANTIC INDEXING TO IMPROVE COMPONENT IDENTIFICATION
    Liu, Feng
    Liu, Chao
    INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2009, : 59 - 63
  • [2] Applying probabilistic latent semantic analysis to multi-criteria recommender system
    Zhang, Yin
    Zhuang, Yueting
    Wu, Jiangqin
    Zhang, Liang
    AI COMMUNICATIONS, 2009, 22 (02) : 97 - 107
  • [3] Identification of latent variables in a semantic odor profile database using principal component analysis
    Zarzo, M.
    Stanton, D.
    CHEMICAL SENSES, 2006, 31 (05) : A144 - A144
  • [4] Identification of latent variables in a semantic odor profile database using principal component analysis
    Zarzo, Manuel
    Stanton, David T.
    CHEMICAL SENSES, 2006, 31 (08) : 713 - 724
  • [5] Traceability Between Business Process and Software Component using Probabilistic Latent Semantic Analysis
    Revindasari, Fony
    Sarno, Riyanarto
    Solichah, Adhatus
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 245 - 250
  • [6] Discriminative Probabilistic Latent Semantic Analysis with Application to Single Sample Face Recognition
    Daoxiang Zhou
    Dan Yang
    Xiaohong Zhang
    Sheng Huang
    Shu Feng
    Neural Processing Letters, 2019, 49 : 1273 - 1298
  • [7] Discriminative Probabilistic Latent Semantic Analysis with Application to Single Sample Face Recognition
    Zhou, Daoxiang
    Yang, Dan
    Zhang, Xiaohong
    Huang, Sheng
    Feng, Shu
    NEURAL PROCESSING LETTERS, 2019, 49 (03) : 1273 - 1298
  • [8] Towards Software Component Procurement Automation with Latent Semantic Analysis
    Gross, Hans-Gerhard
    Lormans, Marco
    Zhou, Jun
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2007, 189 : 51 - 68
  • [9] Accuracy Enhancement of Collaborative Filtering Recommender System for Blogs using Latent Semantic Indexing
    Rohit
    Singh, Anil Kumar
    2017 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [10] A Web Service Recommender System Using Vector Space Model and Latent Semantic Indexing
    Nguyen Ngoc Chan
    Gaaloul, Walid
    Tata, Samir
    25TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA 2011), 2011, : 602 - 609