Automatic Classification of Software Artifacts in Open-Source Applications

被引:19
|
作者
Ma, Yuzhan [1 ]
Fakhoury, Sarah [1 ]
Christensen, Michael [1 ]
Arnaoudova, Venera [1 ]
Zogaan, Waleed [2 ]
Mirakhorli, Mehdi [2 ]
机构
[1] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
[2] Rochester Inst Technol, Dept Software Engn, Rochester, NY 14623 USA
关键词
Open-source software; machine learning; software artifacts; TRACEABILITY;
D O I
10.1145/3196398.3196446
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity of open-source software development, there is a tremendous growth of software artifacts that provide insight into how people build software. Researchers are always looking for large-scale and representative software artifacts to produce systematic and unbiased validation of novel and existing techniques. For example, in the domain of software requirements traceability, researchers often use software applications with multiple types of artifacts, such as requirements, system elements, verifications, or tasks to develop and evaluate their traceability analysis techniques. However, the manual identification of rich software artifacts is very labor-intensive. In this work, we first conduct a large-scale study to identify which types of software artifacts are produced by a wide variety of open-source projects at different levels of granularity. Then we propose an automated approach based on Machine Learning techniques to identify various types of software artifacts. Through a set of experiments, we report and compare the performance of these algorithms when applied to software artifacts.
引用
收藏
页码:414 / 425
页数:12
相关论文
共 50 条
  • [1] GRAPHOS - open-source software for photogrammetric applications
    Gonzalez-Aguilera, D.
    Lopez-Fernandez, L.
    Rodriguez-Gonzalvez, P.
    Hernandez-Lopez, D.
    Guerrero, D.
    Remondino, F.
    Menna, F.
    Nocerino, E.
    Toschi, I.
    Ballabeni, A.
    Gaiani, M.
    [J]. PHOTOGRAMMETRIC RECORD, 2018, 33 (161): : 11 - 29
  • [2] Treat societally impactful scientific insights as open-source software artifacts
    Liem, Cynthia C. S.
    Demetriou, Andrew M.
    [J]. 2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING-SOFTWARE ENGINEERING IN SOCIETY, ICSE-SEIS, 2023, : 150 - 156
  • [3] An open-source software tool for compact modeling applications
    Lemaitre, L
    McAndrew, C
    [J]. IEEE CIRCUITS & DEVICES, 2004, 20 (02): : 6 - +
  • [4] Open-source software - Introduction
    Sabbah, D
    Frye, D
    [J]. IBM SYSTEMS JOURNAL, 2005, 44 (02)
  • [5] Open-source bioinformatics software
    Vlagioiu, Constantin
    Vuta, Vlad
    Barbuceanu, Florica
    Predoi, Gabriel
    Tudor, Nicolae
    [J]. JOURNAL OF BIOTECHNOLOGY, 2017, 256 : S53 - S53
  • [6] Open-source software for repositories
    Vasilyeva, Natalya V.
    [J]. NAUCHNYE I TEKHNICHESKIE BIBLIOTEKI-SCIENTIFIC AND TECHNICAL LIBRARIES, 2023, (03): : 102 - 119
  • [7] Robust open-source software
    Neumann, PG
    [J]. COMMUNICATIONS OF THE ACM, 1999, 42 (02) : 128 - 128
  • [8] OPEN-SOURCE SOFTWARE IN ROBOTICS
    Timoftei, Sanda
    Brad, Emilia
    Sarb, Anca
    Stan, Ovidiu
    [J]. ACTA TECHNICA NAPOCENSIS SERIES-APPLIED MATHEMATICS MECHANICS AND ENGINEERING, 2018, 61 (03): : 519 - 526
  • [9] Optimizing the use of open-source software applications in drug discovery
    Geldenhuys, WJ
    Gaasch, KE
    Watson, M
    Allen, DD
    Van der Schyf, CJ
    [J]. DRUG DISCOVERY TODAY, 2006, 11 (3-4) : 127 - 132
  • [10] Open-source software platform for medical image segmentation applications
    Namias, R.
    D'Amato, J. P.
    del Fresno, M.
    [J]. 13TH INTERNATIONAL CONFERENCE ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2017, 10572