Using Language-Based Search in Mining Large Software Repositories

被引:1
|
作者
Abu Bakar, Normi Sham Awang [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur 53100, Malaysia
关键词
Data retrieval; Software repository; Language - based search; Automation; Software quality;
D O I
10.1016/j.sbspro.2011.10.594
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [31] Cohort Studies for Mining Software Repositories
    Saarimaki, Nyyti
    Vegas, Sira
    Lenarduzzi, Valentina
    Taibi, Davide
    Robredo, Mikel
    2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 569 - 570
  • [32] The Road Ahead for Mining Software Repositories
    Hassan, Ahmed E.
    2008 FRONTIERS OF SOFTWARE MAINTENANCE, 2008, : 48 - 57
  • [33] Mining software repositories for traceability links
    Kagdi, Huzefa
    Maletic, Jonathan I.
    Sharif, Bonita
    ICPC 2007: 15TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, PROCEEDINGS, 2007, : 145 - +
  • [34] Guest editorial: mining software repositories
    Martin Pinzger
    Sunghun Kim
    Empirical Software Engineering, 2016, 21 : 2033 - 2034
  • [35] On Mining Data across Software Repositories
    Anbalagan, Prasanth
    Vouk, Mladen
    2009 6TH IEEE INTERNATIONAL WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES, 2009, : 171 - 174
  • [36] Guest Editorial: Mining software repositories
    Romain Robbes
    Yasutaka Kamei
    Martin Pinzger
    Empirical Software Engineering, 2017, 22 : 1143 - 1145
  • [37] Mining Software Repositories - A Comparative Analysis
    Olatunji, Sunday O.
    Idrees, Syed U.
    Al-Ghamdi, Yasser S.
    Al-Ghamdi, Jarallah Saleh Ali
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (08): : 161 - 174
  • [38] Mining Software Repositories for Accurate Authorship
    Meng, Xiaozhu
    Miller, Barton P.
    Williams, William R.
    Bernat, Andrew R.
    2013 29TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), 2013, : 250 - 259
  • [39] A process to mining issues of Software Repositories
    Bautista, Ana Maria
    San Feliu, Tomas
    2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
  • [40] Guest Editorial: Mining software repositories
    Robbes, Romain
    Kamei, Yasutaka
    Pinzger, Martin
    EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) : 1143 - 1145