Using Language-Based Search in Mining Large Software Repositories

被引:1
|
作者
Abu Bakar, Normi Sham Awang [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur 53100, Malaysia
关键词
Data retrieval; Software repository; Language - based search; Automation; Software quality;
D O I
10.1016/j.sbspro.2011.10.594
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of PACLING Organizing Committee.
引用
收藏
页码:160 / 168
页数:9
相关论文
共 50 条
  • [21] Hybrid Attention Network for Language-Based Person Search
    Li, Yang
    Xu, Huahu
    Xiao, Junsheng
    SENSORS, 2020, 20 (18) : 1 - 23
  • [22] Natural language processing in mining unstructured data from software repositories: a review
    Gupta, Som
    Gupta, S. K.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (12):
  • [23] Natural language processing in mining unstructured data from software repositories: a review
    Som Gupta
    S K Gupta
    Sādhanā, 2019, 44
  • [24] Using Alloy to Support Feature-Based DSL Construction for Mining Software Repositories
    Huang, Changyun
    Kamei, Yasutaka
    Yamashita, Kazuhiro
    Ubayashi, Naoyasu
    PROCEEDINGS OF THE 17TH INTERNATIONAL SOFTWARE PRODUCT LINE CONFERENCE CO-LOCATED WORKSHOPS (SPLC'13 WORKSHOPS), 2013, : 86 - 89
  • [25] Emerging topics in mining software repositories: Machine learning in software repositories and datasets
    Güemes-Peña D.
    López-Nozal C.
    Marticorena-Sánchez R.
    Maudes-Raedo J.
    Progress in Artificial Intelligence, 2018, 7 (3) : 237 - 247
  • [26] MAC: Mining Activity Concepts for Language-based Temporal Localization
    Ge, Runzhou
    Gao, Jiyang
    Chen, Kan
    Nevatia, Ram
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 245 - 253
  • [27] Mining the Urdu Language-Based Web Content for Opinion Extraction
    Syed, Afraz Z.
    Martinez-Enriquez, A. M.
    Nazir, Akhzar
    Aslam, Muhammad
    Basit, Rida Hijab
    PATTERN RECOGNITION (MCPR 2017), 2017, 10267 : 244 - 253
  • [28] VIGHUB: a Technology Forecasting Tool based on Mining Software Repositories
    Giovanny Hidalgo-Suarez, Carlos
    Andres Bucheli-Guerrero, Victor
    Armando Ordonez-Eraso, Hugo
    INGE CUC, 2022, 18 (01)
  • [29] Construction of ontology-based software repositories by text mining
    Wu, Yan
    Siy, Harvey
    Zand, Mansour
    Winter, Victor
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 790 - +
  • [30] Guest editorial: mining software repositories
    Pinzger, Martin
    Kim, Sunghun
    EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (05) : 2033 - 2034