VisRepo: A Visual Retrieval Tool for Large-Scale Open-Source Projects

被引:0
|
作者
Yue, Xiaoqi [1 ]
Liu, Chao [1 ]
Zhang, Neng [2 ]
Hu, Haibo [1 ]
Zhang, Xiaohong [1 ]
机构
[1] Chongqing Univ, Chongqing, Peoples R China
[2] Sun Yat Sen Univ, Zhuhai, Peoples R China
基金
中国博士后科学基金;
关键词
Open-Source Project Retrieval; Visualization; Software Data Mining;
D O I
10.1145/3671016.3671409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve software development productivity, developers frequently search for projects on open-source communities such as GitHub. However, it is challenging for users to quickly find suitable projects from numerous results due to the overload of project information. Although many tools have been proposed to rank the relevancy of searched results, manually inspecting them one by one is irreplaceable and time-consuming. To fill this gap, we propose a visual retrieval tool named VisRepo for open-source software projects. Firstly, it mines software project data from four perspectives including topic, technology, usability, and comprehensibility, and connects projects based on the same owners/contributors and similar topics. Then, visualization technique is employed to present complex software data intuitively. VisRepo provides users an interactive retrieval paradigm of Search-Explore-Check-Recommend with in-depth insights and better exploration experience. We evaluate VisRepo on 7w+ open-source JavaScript projects. Experimental results showed that VisRepo outperforms GitHub search engine in terms of time consumption and accuracy, meanwhile enabling a more interactive and useful user experience. Demo Source Code: https://github.com/YUEchn/visrepo Demo Video: https://youtu.be/-fqL8ngSmwQ
引用
收藏
页码:499 / 502
页数:4
相关论文
共 50 条
  • [41] Empowering OCL research: a large-scale corpus of open-source data from GitHub
    Mengerink, Josh G. M.
    Noten, Jeroen
    Serebrenik, Alexander
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (03) : 1574 - 1609
  • [42] Leveraging Human Oversight and Intervention in Large-Scale Parallel Processing of Open-source Data
    Casini, Enrico
    Suri, Niranjan
    Bradshaw, Jeffrey M.
    NEXT-GENERATION ANALYST III, 2015, 9499
  • [43] Developing Goobi - An Open-Source Workflow Tracking Tool for Digitization Projects
    Hankiewicz, Steffen
    ARCHIVING 2016: FINAL PROGRAM AND PROCEEDINGS, 2016, : 131 - 134
  • [44] An Open Source Tool to Compare Simulators on Large-Scale Cases - Application to Dynawo
    Luis Marin, Jose
    Gaitan, Vicenc
    Oms, Guiu
    Chiaramello, Marco
    Cossart, Quentin
    Guironnet, Adrien
    2022 OPEN SOURCE MODELLING AND SIMULATION OF ENERGY SYSTEM (OSMSES), 2022,
  • [45] Verification in the light and large: Large-scale verification for fast-moving open source C projects
    Breuer, Peter T.
    Pickin, Simon
    31ST IEEE SOFTWARE ENGINEERING WORKSHOP, PROCEEDINGS, 2007, : 246 - 255
  • [46] FOUNTAIN: A JAVA open-source package to assist large sequencing projects
    Jean-Marie Buerstedde
    Florian Prill
    BMC Bioinformatics, 2
  • [47] Identifying and characterizing change-prone classes in two large-scale open-source products
    Koru, A. Guenes
    Liu, Hongfang
    JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (01) : 63 - 73
  • [48] Interpreting Large-Scale Attacks Against Open-Source Medical Systems Using eXplainable AI
    Lu, Wei
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2024, 2024, 87 : 60 - 71
  • [49] Comparison of Numerical Methods and Open-Source Libraries for Eigenvalue Analysis of Large-Scale Power Systems
    Tzounas, Georgios
    Dassios, Ioannis
    Liu, Muyang
    Milano, Federico
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 27
  • [50] Open source tools for large-scale neuroscience
    Freeman, Jeremy
    CURRENT OPINION IN NEUROBIOLOGY, 2015, 32 : 156 - 163