Using Language-Based Search in Mining Large Software Repositories

被引:1
|
作者
Abu Bakar, Normi Sham Awang [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur 53100, Malaysia
关键词
Data retrieval; Software repository; Language - based search; Automation; Software quality;
D O I
10.1016/j.sbspro.2011.10.594
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [41] The challenges & case for mining software repositories
    Razzaq, Saad
    Maqbool, Fahad
    Anjum, Bilal
    Zafar, Samreen
    Laila, Umme
    Noor, Faiza
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 734 - +
  • [42] Mining Software Repositories for Social Norms
    Dam, Hoa Khanh
    Savarimuthu, Bastin Tony Roy
    Avery, Daniel
    Ghose, Aditya
    2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, Vol 2, 2015, : 627 - 630
  • [43] Programming with proofs: Language-based approaches to totally correct software
    Stump, Aaron
    VERIFIED SOFTWARE: THEORIES, TOOLS, EXPERIMENTS, 2008, 4171 : 502 - 509
  • [44] A Survey of Natural Language-Based Editing of Low-Code Applications Using Large Language Models
    Gorissen, Simon Cornelius
    Sauer, Stefan
    Beckmann, Wolf G.
    HUMAN-CENTERED SOFTWARE ENGINEERING, HCSE 2024, 2024, 14793 : 243 - 254
  • [45] Language-based access control approach for component-based software applications
    Su, R.
    Lukkien, J. J.
    Chaudron, M. R. V.
    IET SOFTWARE, 2007, 1 (05) : 206 - 216
  • [46] Learning from Software Project Histories Predictive Studies Based on Mining Software Repositories
    Honsel, Verena
    Herbold, Steffen
    Grabowski, Jens
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2016, PT III, 2016, 9853 : 267 - 270
  • [47] Mining Individual Performance Indicators in Collaborative Development Using Software Repositories
    Zhang, Shen
    Wang, Yongji
    Xiao, Junchao
    APSEC 2008:15TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2008, : 247 - 254
  • [48] A language-based access control approach for component-based software applications
    Su, Rong
    Lukkien, Johan J.
    Chaudron, Michel R. V.
    PROCEEDINGS OF THE 10TH IASTED INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND APPLICATIONS, 2006, : 233 - +
  • [49] Depth-first Search Approach for Language-based Opacity Verification Using Petri Nets
    Saadaoui, I.
    Li, Z.
    Wu, N.
    Khalgui, M.
    IFAC PAPERSONLINE, 2020, 53 (04): : 378 - 383
  • [50] Boa: A Language and Infrastructure for Analyzing Ultra-Large-Scale Software Repositories
    Dyer, Robert
    Hoan Anh Nguyen
    Rajan, Hridesh
    Nguyen, Tien N.
    PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), 2013, : 422 - 431