Large-Scale Dataset of Local Java']Java Software Build Results

被引:7
|
作者
Sulir, Matus [1 ]
Bacikova, Michaela [1 ]
Madeja, Matej [1 ]
Chodarev, Sergej [1 ]
Juhar, Jan [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Comp & Informat, Letna 9, Kosice 04200, Slovakia
关键词
build tool; program compilation; failure; Ant; Maven; Gradle;
D O I
10.3390/data5030086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When a person decides to inspect or modify a third-party software project, the first necessary step is its successful compilation from source code using a build system. However, such attempts often end in failure. In this data descriptor paper, we provide a dataset of build results of open source Java software systems. We tried to automatically build a large number of Java projects from GitHub using their Maven, Gradle, and Ant build scripts in a Docker container simulating a standard programmer's environment. The dataset consists of the output of two executions: 7264 build logs from a study executed in 2016 and 7233 logs from the 2020 execution. In addition to the logs, we collected exit codes, file counts, and various project metadata. The proportion of failed builds in our dataset is 38% in the 2016 execution and 59% in the 2020 execution. The published data can be helpful for multiple purposes, such as correlation analysis of factors affecting build success, build failure prediction, and research in the area of build breakage repair.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] SourcererJBF: A Java']Java Build Framework For Large-Scale Compilation
    Misu, Md Rakib Hossain
    Achar, Rohan
    Lopes, Cristina V.
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
  • [2] Large-scale image deblurring in Java']Java
    Wendykier, Piotr
    Nagy, James G.
    [J]. COMPUTATIONAL SCIENCE - ICCS 2008, PT 1, 2008, 5101 : 721 - 730
  • [3] Large-scale characterization of Java']Java streams
    Rosales, Eduardo
    Basso, Matteo
    Rosa, Andrea
    Binder, Walter
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (09): : 1763 - 1792
  • [4] Java']Java for large-scale scientific computations?
    Krall, A
    Tomsich, P
    [J]. LARGE-SCALE SCIENTIFIC COMPUTING, 2001, 2179 : 228 - 235
  • [5] Java']Java communications for large-scale parallel computing
    Getov, V
    Philippsen, M
    [J]. LARGE-SCALE SCIENTIFIC COMPUTING, 2001, 2179 : 33 - 45
  • [6] Bugs.jar: A Large-scale, Diverse Dataset of Real-world Java']Java Bugs
    Saha, Ripon K.
    Lyu, Yingjun
    Lam, Wing
    Yoshida, Hiroaki
    Prasad, Mukul R.
    [J]. 2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 10 - 13
  • [7] Automatic repair of real bugs in java']java: a large-scale experiment on the defects4j dataset
    Martinez, Matias
    Durieux, Thomas
    Sommerard, Romain
    Xuan, Jifeng
    Monperrus, Martin
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (04) : 1936 - 1964
  • [8] APINetworks Java']Java. A Java']Java approach to the efficient treatment of large-scale complex networks
    Munoz-Caro, Camelia
    Nino, Alfonso
    Reyes, Sebastian
    Castillo, Miriam
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2016, 207 : 549 - 552
  • [9] Large-scale parallel geophysical algorithms in Java']Java: a feasibility study
    Jacob, M
    Philippsen, M
    Karrenbach, M
    [J]. CONCURRENCY-PRACTICE AND EXPERIENCE, 1998, 10 (11-13): : 1143 - 1153
  • [10] A large-scale study on the usage of Java']Java's concurrent programming constructs
    Pinto, Gustavo
    Torres, Weslley
    Fernandes, Benito
    Castor, Fernando
    Barros, Roberto S. M.
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 106 : 59 - 81