ON THE DISTRIBUTION OF SOURCE CODE FILE SIZES

被引:0
|
作者
Herraiz, Israel [1 ]
German, Daniel M. [2 ]
Hassan, Ahmed E. [3 ]
机构
[1] Tech Univ Madrid, Madrid, Spain
[2] Univ Victoria, Victoria, BC, Canada
[3] Queens Univ, Kingston, ON, Canada
关键词
Mining software repositories; Software size estimation; Open source;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source code size is an estimator of software effort. Size is also often used to calibrate models and equations to estimate the cost of software. The distribution of source code file sizes has been shown in the literature to be a lognormal distribution. In this paper, we measure the size of a large collection of software (the Debian GNU/Linux distribution version 5.0.2), and we find that the statistical distribution of its source code file sizes follows a double Pareto distribution. This means that large files are to be found more often than predicted by the lognormal distribution, therefore the previously proposed models underestimate the cost of software.
引用
收藏
页码:5 / 14
页数:10
相关论文
共 50 条
  • [21] Coded Caching for Two Users with Distinct File Sizes
    Xie, Xinyu
    Tan, Weiyi
    Zhang, Jinbei
    Luo, Zhiyong
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 2542 - 2547
  • [22] Decentralized Coded Caching Scheme With Heterogeneous File Sizes
    Zheng, Lei
    Chen, Qingchun
    Yan, Qifa
    Tang, Xiaohu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) : 818 - 827
  • [23] Closing the Gap for Coded Caching with Distinct File Sizes
    Zhang, Jinbei
    Lin, Xiaojun
    Wang, Chih-Chun
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 687 - 691
  • [24] The medical file: a source of conflict
    Jan, F
    Raguin, M
    PRESSE MEDICALE, 2001, 30 (17): : 861 - 864
  • [25] Current issues source file
    Burton, DL
    JOURNAL OF GOVERNMENT INFORMATION, 1997, 24 (03): : 231 - 232
  • [26] SOURCE CODE
    Lopez, Julyssa
    NATION, 2020, 311 (05) : 36 - 37
  • [27] SOURCE CODE
    WEISER, M
    COMPUTER, 1987, 20 (11) : 66 - 73
  • [28] Source code
    Lizza, R
    NEW REPUBLIC, 2005, 233 (07) : 11 - 13
  • [29] SOURCE CODE
    Warden, Barry
    SIGHT AND SOUND, 2023, 33 (03): : 22 - 22
  • [30] SOURCE CODE
    Pedroni, Federico
    CINEFORUM, 2011, 51 (04): : 9 - 13