Repositories with public data about software development

被引:1
|
作者
Gonzalez-Barahona, Jesus M. [1 ]
Izquierdo-Cortazar, Daniel [1 ]
Squire, Megan [2 ]
机构
[1] Universidad Rey Juan Carlos, Spain
[2] Elon University, United States
基金
美国国家科学基金会;
关键词
Codes (symbols) - Open source software - Software design;
D O I
10.4018/jossp.2010040101
中图分类号
学科分类号
摘要
Empirical research on software development based on data obtained from project repositories and code forges is increasingly gaining attention in the software engineering research community. The studies in this area typically start by retrieving or monitoring some subset of data found in the repository or forge, and this data is later analyzed to find interesting patterns. However, retrieving information from these locations can be a challenging task. Meta-repositories providing public information about software development are useful tools that can simplify and streamline the research process. Public data repositories that collect and clean the data from other project repositories or code forges can help ensure that research studies are based on good quality data. This paper provides some insight as to how these meta-repositories (sometimes called a repository of repositories, RoR) of data about open source projects should be used to help researchers. This paper describes in detail two of the most widely used collections of data about software development: FLOSSmole and FLOSSMetrics. © 2010, IGI Global.
引用
收藏
页码:1 / 13
相关论文
共 50 条
  • [1] The promise of public software engineering data repositories
    Cukic, B
    [J]. IEEE SOFTWARE, 2005, 22 (06) : 20 - 22
  • [2] Library adoption in public software repositories
    Krohn, Rachel
    Weninger, Tim
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [3] Library adoption in public software repositories
    Rachel Krohn
    Tim Weninger
    [J]. Journal of Big Data, 6
  • [4] 4th International Workshop on Public Data about Software Development
    Gonzalez-Barahona, Jesus M.
    Squire, Megan
    Izquierdo-Cortazar, Daniel
    [J]. OPEN SOURCE ECOSYSTEMS-DIVERSE COMMUNITIES INTERACTING, 2009, 299 : 351 - +
  • [5] On Mining Data across Software Repositories
    Anbalagan, Prasanth
    Vouk, Mladen
    [J]. 2009 6TH IEEE INTERNATIONAL WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES, 2009, : 171 - 174
  • [6] WoPDaSD 2010: 5th Workshop on Public Data about Software Development
    Gonzalez-Barahona, Jesus M.
    Squire, Megan
    Izquierdo-Cortazar, Daniel
    [J]. OPEN SOURCE SOFTWARE: NEW HORIZONS, 2010, 319 : 421 - +
  • [7] Submission of microarray data to public repositories
    Ball, CA
    Brazma, A
    Causton, H
    Chervitz, S
    Edgar, R
    Hingamp, P
    Matese, JC
    Parkinson, H
    Quackenbush, J
    Ringwald, M
    Sansone, SA
    Sherlock, G
    Spellman, P
    Stoeckert, C
    Tateno, Y
    Taylor, R
    White, J
    Winegarden, N
    [J]. PLOS BIOLOGY, 2004, 2 (09) : 1276 - 1277
  • [8] Combining Software Interrelationship Data across Heterogeneous Software Repositories
    Ilo, Nikola
    Grabner, Johann
    Artner, Thomas
    Bernhart, Mario
    Grechenig, Thomas
    [J]. 2015 31ST INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME) PROCEEDINGS, 2015, : 571 - 575
  • [9] Visual data mining and analysis of software repositories
    Voinea, Lucian
    Telea, Alexandru
    [J]. COMPUTERS & GRAPHICS-UK, 2007, 31 (03): : 410 - 428
  • [10] A method and software framework for enriching private biomedical sources with data from public online repositories
    Anguita, Alberto
    Garcia-Remesal, Miguel
    Graf, Norbert
    Maojo, Victor
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 60 : 177 - 186