Predicting OSS Development Success: A Data Mining Approach

被引:1
|
作者
Raja, Uzma [1 ]
Tretter, Marietta J. [2 ]
机构
[1] Univ Alabama, Management Informat Syst, Tuscaloosa, AL 35487 USA
[2] Texas A&M Univ, Dept Informat & Operat Management, College Stn, TX 77843 USA
关键词
Data Mining; Data Models; Decision Trees; Logistic Regression; Neural Networks; Open Source Software; Software Development;
D O I
10.4018/jismd.2011100102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Open Source Software (OSS) has reached new levels of sophistication and acceptance by users and commercial software vendors. This research creates tests and validates a model for predicting successful development of OSS projects. Widely available archival data was used for OSS projects from Sourceforge. net. The data is analyzed with multiple Data Mining techniques. Initially three competing models are created using Logistic Regression, Decision Trees and Neural Networks. These models are compared for precision and are refined in several phases. Text Mining is used to create new variables that improve the predictive power of the models. The final model is chosen based on best fit to separate training and validation data sets and the ability to explain the relationship among variables. Model robustness is determined by testing it on a new dataset extracted from the SF repository. The results indicate that end-user involvement, project age, functionality, usage, project management techniques, project type and team communication methods have a significant impact on the development of OSS projects.
引用
收藏
页码:27 / 48
页数:22
相关论文
共 50 条
  • [1] A Data mining Technique for Analyzing and Predicting the success of Movie
    Meenakshi, K.
    Maragatham, G.
    Agarwal, Neha
    Ghosh, Ishitha
    [J]. PROCEEDINGS OF THE 10TH NATIONAL CONFERENCE ON MATHEMATICAL TECHNIQUES AND ITS APPLICATIONS (NCMTA 18), 2018, 1000
  • [2] Data Mining Approach for Making Prediction of Students Success
    Osmanbegovic, Edin
    Agic, Haris
    Suljic, Mirza
    [J]. PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED ICT AND EDUCATION, 2013, 33 : 722 - 726
  • [3] Predicting retail business success using urban social data mining
    Papadimitriou, Georgios
    Komninos, Andreas
    Garofalakis, John
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2020, 12 (03) : 263 - 277
  • [4] Predicting Resurgery in Intensive Care - A data Mining Approach
    Peixoto, Ricardo
    Ribeiro, Lisete
    Portela, Filipe
    Santos, Manuel Filipe
    Rua, Fernando
    [J]. 8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 577 - 584
  • [5] Predicting Colorectal Cancer Survival: A Data Mining Approach
    Hosseini, Nooshin
    [J]. GASTROENTEROLOGY, 2014, 146 (05) : S688 - S689
  • [6] An Approach for Predicting Employee Churn by Using Data Mining
    Yigit, Ibrahim Onuralp
    Shourabizadeh, Hamed
    [J]. 2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [7] Predicting arterial breakdown probability: A data mining approach
    Iqbal, Md Shahadat
    Hadi, Mohammed
    Xiao, Yan
    [J]. JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 21 (03) : 190 - 201
  • [8] Application of Data Mining Techniques for Predicting Student Success in English Exit Exam
    Puarungroj, Wichai
    Boonsirisumpun, Narong
    Pongpatrakant, Pathapong
    Phromkhot, Suchada
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2018), 2018,
  • [9] ALTERNATIVE APPROACH TO USING BIOGRAPHICAL DATA PREDICTING JOB SUCCESS
    MATTESON, MT
    [J]. JOURNAL OF OCCUPATIONAL PSYCHOLOGY, 1978, 51 (02): : 155 - 162
  • [10] A supervised data mining approach for predicting comment card ratings
    Tanrisevdi, Abdullah
    Ozturk, Gozde
    Ozturk, Ahmet Cumhur
    [J]. INTERNATIONAL JOURNAL OF CONTEMPORARY HOSPITALITY MANAGEMENT, 2022, 34 (05) : 1823 - 1853