A Method to Detect License Inconsistencies in Large-Scale Open Source Projects

被引:17
|
作者
Wu, Yuhao [1 ]
Manabe, Yuki [2 ]
Kanda, Tetsuya [1 ]
German, Daniel M. [3 ]
Inoue, Katsuro [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 565, Japan
[2] Kumamoto Univ, Grad Sch Sci & Technol, Kumamoto 860, Japan
[3] Univ Victoria, Dept Comp Sci, Victoria, BC V8W 2Y2, Canada
关键词
D O I
10.1109/MSR.2015.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The reuse of free and open source software (FOSS) components is becoming more and more popular. They usually contain one or more software licenses describing the requirements and conditions which should be followed when been reused. Licenses are usually written in the header of source code files as program comments. Removing or modifying the license header by re-distributors will result in the inconsistency of license with its ancestor, and may potentially cause license infringement. But to the best of our knowledge, no research has been devoted to investigate such kind of license infringements nor license inconsistencies. In this paper, we describe and categorize different types of license inconsistencies and propose a feasible method to detect them. Then we apply this method to Debian 7.5 and present the license inconsistencies found in it. With a manual analysis, we summarized various reasons behind these license inconsistencies, some of which imply license infringement and require the attention from the developers. This analysis also exposes the difficulty to discover license infringements, highlighting the usefulness of finding and maintaining source code provenance.
引用
收藏
页码:324 / 333
页数:10
相关论文
共 50 条
  • [1] A Large-scale Dataset of (Open Source) License Text Variants
    Zacchiroli, Stefano
    [J]. 2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 757 - 761
  • [2] Open Source License Inconsistencies on GitHub
    Wolter, Thomas
    Barcomb, Ann
    Riehle, Dirk
    Harutyunyan, Nikolay
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (05)
  • [3] Software evolution in open source projects - a large-scale investigation
    Koch, Stefan
    [J]. JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2007, 19 (06): : 361 - 382
  • [4] A Large-Scale Empirical Study of Open Source License Usage: Practices and Challenges
    Wu, Jiaqi
    Bao, Lingfeng
    Yang, Xiaohu
    Xia, Xin
    Hu, Xing
    [J]. 2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 595 - 606
  • [5] Analysis of license inconsistency in large collections of open source projects
    Wu, Yuhao
    Manabe, Yuki
    Kanda, Tetsuya
    German, Daniel M.
    Inoue, Katsuro
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) : 1194 - 1222
  • [6] Analysis of license inconsistency in large collections of open source projects
    Yuhao Wu
    Yuki Manabe
    Tetsuya Kanda
    Daniel M. German
    Katsuro Inoue
    [J]. Empirical Software Engineering, 2017, 22 : 1194 - 1222
  • [7] Evaluating Maintainability Prejudices with a Large-Scale Study of Open-Source Projects
    Roehm, Tobias
    Veihelmann, Daniel
    Wagner, Stefan
    Juergens, Elmar
    [J]. SOFTWARE QUALITY: THE COMPLEXITY AND CHALLENGES OF SOFTWARE ENGINEERING AND SOFTWARE QUALITY IN THE CLOUD, 2019, 338 : 151 - 171
  • [8] VisRepo: A Visual Retrieval Tool for Large-Scale Open-Source Projects
    Yue, Xiaoqi
    Liu, Chao
    Zhang, Neng
    Hu, Haibo
    Zhang, Xiaohong
    [J]. PROCEEDINGS OF THE 15TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE, INTERNETWARE 2024, 2024, : 499 - 502
  • [9] A large-scale empirical exploration on refactoring activities in open source software projects
    Vassallo, Carmine
    Grano, Giovanni
    Palomba, Fabio
    Gall, Harald C.
    Bacchelli, Alberto
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2019, 180 : 1 - 15
  • [10] Code Coverage and Postrelease Defects: A Large-Scale Study on Open Source Projects
    Kochhar, Pavneet Singh
    Lo, David
    Lawall, Julia
    Nagappan, Nachiappan
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2017, 66 (04) : 1213 - 1228