Using Language-Based Search in Mining Large Software Repositories

被引:1
|
作者
Abu Bakar, Normi Sham Awang [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur 53100, Malaysia
关键词
Data retrieval; Software repository; Language - based search; Automation; Software quality;
D O I
10.1016/j.sbspro.2011.10.594
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [1] Language-Based Access to Large Sensor Repositories
    Baumann, Peter
    SENSORS, 2009, 9 (04): : 2926 - 2943
  • [2] Language-Based Software Testing
    Steinhofel, Dominic
    Zeller, Andreas
    COMMUNICATIONS OF THE ACM, 2024, 67 (04) : 77 - 81
  • [3] Language-based software engineering
    Gupta, Gopal
    SCIENCE OF COMPUTER PROGRAMMING, 2015, 97 : 37 - 40
  • [4] Using Pig as a data preparation language for large-scale mining software repositories studies: An experience report
    Shang, Weiyi
    Adams, Bram
    Hassan, Ahmed E.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (10) : 2195 - 2204
  • [5] Language-based approaches to software measurement
    Cogan, BI
    Hunter, RB
    PROCEEDINGS OF THE 3RD INTERNATIONAL SOFTWARE METRICS SYMPOSIUM, 1996, : 3 - 9
  • [6] Software Process Simulation based on Mining Software Repositories
    Honsel, Verena
    Honsel, Daniel
    Grabowski, Jens
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 828 - 831
  • [7] Mining software repositories
    1600, Japan Society for Software Science and Technology (30):
  • [8] Unsupervised software repositories mining and its application to code search
    Hu, Gang
    Peng, Min
    Zhang, Yihan
    Xie, Qianqian
    Gao, Wang
    Yuan, Mengting
    SOFTWARE-PRACTICE & EXPERIENCE, 2020, 50 (03): : 299 - 322
  • [9] Mining Software Repositories Using Topic Models
    Thomas, Stephen W.
    2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 1138 - 1139
  • [10] Synthesis of Benchmarks for the C Programming Language by Mining Software Repositories
    Guimaraes, Breno C. F.
    Magalhaes, Jose Wesley de S.
    da Silva, Anderson Faustino
    Pereira, Fernando M. Q.
    XXIII BRAZILIAN SYMPOSIUM ON PROGRAMMING LANGUAGES, 2019, : 62 - 69