Predicting Programming Behavior in OSS Communities: A Case Study of NLP-based Approach

被引:0
|
作者
Huo, Manyan [1 ]
Yu, Yue [1 ]
Li, Zhixing [1 ]
Chang, Junsheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Key Lab Parallel & Distributed Comp Lab, Changsha, Peoples R China
基金
国家重点研发计划;
关键词
Behavior Prediction; OSS Developer; word2vec; Mining Software Repositories;
D O I
10.1109/ICAICE51518.2020.00091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction of developers' programming behaviors is an effective way to improve their development efficiency and optimize the organization of project modules and files. However, little research exists investigating on this direction. In order to address this knowledge gap, we proposed a NLP-based approach to predict the programming behaviors in OSS (Open Source Software) communities. The proposed approach i) embeds the historical programming behavior data of a project into a multi-dimensional vector space to capture the potential laws in the data, ii) forms an eigenvector matrix reflecting the semantic relationship of the development behavior data, and predicts the next programming behavior of a specific developer based on the eigenvector matrix. Our experiments on five OSS projects show that the prediction accuracy rate of the proposed prediction approach can reach up to about 50%, indicating that it can summarize the development behavior data law and effectively predict the programming behavior of developers. Our work can provide valuable assistance for developers' programming and projects' maintenance in practice.
引用
收藏
页码:430 / 439
页数:10
相关论文
共 50 条
  • [21] Mapping Partners Master Drug Dictionary to RxNorm using an NLP-based approach
    Zhou, Li
    Plasek, Joseph M.
    Mahoney, Lisa M.
    Chang, Frank Y.
    DiMaggio, Dana
    Rocha, Roberto A.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (04) : 626 - 633
  • [22] NLP-Based Approach to Semantic Classification of Heterogeneous Transportation Asset Data Terminology
    Le, Tuyen
    Jeong, H. David
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2017, 31 (06)
  • [23] An NLP-based approach to assessing a company's maturity level in the digital era
    Romano, Simon Pietro
    Sperli, Giancarlo
    Vignali, Andrea
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [24] Querying Brazilian Educational Open Data using a Hybrid NLP-based Approach
    Antoni, Marco
    Charao, Andrea
    Franciscatto, Maria
    ICEIS: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 2, 2021, : 120 - 130
  • [25] NLP-based approach for automated safety requirements information retrieval from project documents
    Wu, Zhijiang
    Ma, Guofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 239
  • [26] A case study of duplications detection for educational domain thorough ad hoc search and identification NLP-based method
    Mikhaylov, S. N.
    Chuikova, V. V.
    Sokolova, Marina V.
    Potapenko, A. M.
    EXPERT SYSTEMS, 2017, 34 (04)
  • [27] A Machine Learning Approach for the NLP-Based Analysis of Cyber Threats and Vulnerabilities of the Healthcare Ecosystem
    Silvestri, Stefano
    Islam, Shareeful
    Papastergiou, Spyridon
    Tzagkarakis, Christos
    Ciampi, Mario
    SENSORS, 2023, 23 (02)
  • [28] A NLP-based stylometric approach for tracking the evolution of L1 written language competence
    Miaschi, Alessio
    Brunato, Dominique
    Dell'Orletta, Felice
    JOURNAL OF WRITING RESEARCH, 2021, 13 (01) : 71 - 105
  • [29] Exploring barriers affecting eLearning usage intentions: an NLP-based multi-method approach
    Ray, Arghya
    Bala, Pradip Kumar
    Dwivedi, Yogesh K.
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2022, 41 (05) : 1002 - 1018
  • [30] A Feasibility Study on Evasion Attacks Against NLP-Based Macro Malware Detection Algorithms
    Mimura, Mamoru
    Yamamoto, Risa
    IEEE ACCESS, 2023, 11 : 138336 - 138346