A Large-Scale Empirical Study of Open Source License Usage: Practices and Challenges

被引:0
|
作者
Wu, Jiaqi [1 ]
Bao, Lingfeng [1 ]
Yang, Xiaohu [1 ]
Xia, Xin [2 ]
Hu, Xing [1 ]
机构
[1] Zhejiang Univ, State Key Lab Blockchain & Data Secur, Hangzhou, Zhejiang, Peoples R China
[2] Huawei, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
OSS Licenses; Empirical Study; Package Management Platform;
D O I
10.1145/3643991.3644900
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popularity of open source software (OSS) has led to a significant increase in the number of available licenses, each with their own set of terms and conditions. This proliferation of licenses has made it increasingly challenging for developers to select an appropriate license for their projects and to ensure that they are complying with the terms of those licenses. As a result, there is a need for empirical studies to identify current practices and challenges in license usage, both to help developers make informed decisions about license selection and to ensure that OSS is being used and distributed in a legal and ethical manner. Moreover, the development of new licenses might be required to better meet the needs of the open source community and address emerging legal issues. In this paper, we conduct a large-scale empirical study of license usage across five package management platforms, i.e., Maven, NPM, PyPI, RubyGems, and Cargo. Our objective is to examine the current trends and potential issues in license usage of the OSS community. In total, we analyze the licenses of 33,710,877 packages across the selected five platforms. We statistically analyze licenses in package management platforms from multiple perspectives, e.g., license usage, license incompatibility, license updates, and license evolution. Moreover, we conduct a comparative study of various aspects of core packages and common packages in these platforms. Our results reveal irregularities in license names and license incompatibilities that require attention. We observe both similarities and differences in license usage across the five platforms, with Cargo being the most standardized among them. Finally, we discuss some implications for actions based on our findings.
引用
收藏
页码:595 / 606
页数:12
相关论文
共 50 条
  • [1] License usage and changes: a large-scale study on gitHub
    Vendome, Christopher
    Bavota, Gabriele
    Di Penta, Massimiliano
    Linares-Vasquez, Mario
    German, Daniel
    Poshyvanyk, Denys
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) : 1537 - 1577
  • [2] License usage and changes: a large-scale study on gitHub
    Christopher Vendome
    Gabriele Bavota
    Massimiliano Di Penta
    Mario Linares-Vásquez
    Daniel German
    Denys Poshyvanyk
    [J]. Empirical Software Engineering, 2017, 22 : 1537 - 1577
  • [3] A Large-scale Dataset of (Open Source) License Text Variants
    Zacchiroli, Stefano
    [J]. 2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 757 - 761
  • [4] A Method to Detect License Inconsistencies in Large-Scale Open Source Projects
    Wu, Yuhao
    Manabe, Yuki
    Kanda, Tetsuya
    German, Daniel M.
    Inoue, Katsuro
    [J]. 12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 324 - 333
  • [5] A Large-Scale Study of MPI Usage in Open-Source HPC Applications
    Laguna, Ignacio
    Marshall, Ryan
    Mohror, Kathryn
    Ruefenacht, Martin
    Skjellum, Anthony
    Sultana, Nawrin
    [J]. PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [6] A Large Scale Study of License Usage on GitHub
    Vendome, Christopher
    [J]. 2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 772 - 774
  • [7] License Usage and Changes: A Large-Scale Study of Java']Java Projects on GitHub
    Vendome, Christopher
    Linares-Vasquez, Mario
    Bavota, Gabriele
    Di Penta, Massimiliano
    German, Daniel
    Poshyvanyk, Denys
    [J]. 2015 IEEE 23RD INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION ICPC 2015, 2015, : 218 - 228
  • [8] Test Automation in Open-Source Android Apps: A Large-Scale Empirical Study
    Lin, Jun-Wei
    Salehnamadi, Navid
    Malek, Sam
    [J]. 2020 35TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2020), 2020, : 1078 - 1089
  • [9] An Empirical Study of License Violations in Open Source Projects
    Mathur, Arunesh
    Choudhary, Harshal
    Vashist, Priyank
    Thies, William
    Thilagam, Santhi
    [J]. PROCEEDINGS OF THE 2012 IEEE 35TH SOFTWARE ENGINEERING WORKSHOP (SEW 2012), 2012, : 168 - 176
  • [10] A Large-Scale Empirical Study of Real-Life Performance Issues in Open Source Projects
    Zhao, Yutong
    Xiao, Lu
    Bondi, Andre B.
    Chen, Bihuan
    Liu, Yang
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (02) : 924 - 946