Multilevel Readability Interpretation Against Software Properties: A Data-Centric Approach

被引:1
|
作者
Karanikiotis, Thomas [1 ]
Papamichail, Michail D. [1 ]
Symeonidis, Andreas L. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Elect & Comp Engn Dept, Informat Proc Lab, Intelligent Syst & Software Engn Labgrp, Thessaloniki, Greece
来源
关键词
Developer-perceived readability; Readability interpretation; Size-based clustering; Support vector regression; SUPPORT;
D O I
10.1007/978-3-030-83007-6_10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the wide adoption of the agile software development paradigm, where efficient collaboration as well as effective maintenance are of utmost importance, the need to produce readable source code is evident. To that end, several research efforts aspire to assess the extent to which a software component is readable. Several metrics and evaluation criteria have been proposed; however, they are mostly empirical or rely on experts who are responsible for determining the ground truth and/or set custom thresholds, leading to results that are context-dependent and subjective. In this work, we employ a large set of static analysis metrics along with various coding violations towards interpreting readability as perceived by developers. Unlike already existing approaches, we refrain from using experts and we provide a fully automated and extendible methodology built upon data residing in online code hosting facilities. We perform static analysis at two levels (method and class) and construct a benchmark dataset that includes more than one million methods and classes covering diverse development scenarios. After performing clustering based on source code size, we employ Support Vector Regression in order to interpret the extent to which a software component is readable against the source code properties: cohesion, inheritance, complexity, coupling, and documentation. The evaluation of our methodology indicates that our models effectively interpret readability as perceived by developers against the above mentioned source code properties.
引用
收藏
页码:203 / 226
页数:24
相关论文
共 50 条
  • [31] A data-centric approach for ethical and trustworthy AI in journalism
    Dierickx, Laurence
    Opdahl, Andreas Lothe
    Khan, Sohail Ahmed
    Linden, Carl-Gustav
    Guerrero Rojas, Diana Carolina
    ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (04)
  • [32] A data-centric approach to understanding the pricing of financial options
    J. Healy
    M. Dixon
    B. Read
    F.F. Cai
    The European Physical Journal B - Condensed Matter and Complex Systems, 2002, 27 : 219 - 227
  • [33] Understanding the Indian Labour Market: A Data-Centric Approach
    Shabana, K. M.
    Gracious, Tony
    Subramonian, Hrishikesh
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 26 - 31
  • [34] A participatory data-centric approach to AI Ethics by Design
    Gerdes, Anne
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [35] A data-centric approach to high-level synthesis
    Tarafdar, S
    Leeser, M
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2000, 19 (11) : 1251 - 1267
  • [36] Data-Centric Optimization Approach for Small, Imbalanced Datasets
    Tanov, Vladislav
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2023, 47 (01) : 167 - 177
  • [37] Reliability evaluation of individual predictions: a data-centric approach
    Shahbazi, Nima
    Asudeh, Abolfazl
    VLDB JOURNAL, 2024, 33 (04): : 1203 - 1230
  • [38] A data-centric approach for scalable state machine replication
    Chockler, G
    Malkhi, D
    Dolev, D
    FUTURE DIRECTIONS IN DISTRIBUTED COMPUTING: RESEARCH AND POSITION PAPERS, 2003, 2584 : 159 - 163
  • [39] Identification of the Barriers to Data-Centric Approach in the Construction Industry
    Karji, Ali
    Messner, John
    Leicht, Robert
    McComb, Christopher
    CONSTRUCTION RESEARCH CONGRESS 2022: PROJECT MANAGEMENT AND DELIVERY, CONTRACTS, AND DESIGN AND MATERIALS, 2022, : 1002 - 1011
  • [40] Dynamic Load Balancing in Cloud A Data-Centric Approach
    Dasoriya, Rayan
    Kotadiya, Purvi
    Arya, Garima
    Nayak, Priyanshu
    Mistry, Kamal
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 162 - 166