Multilevel Readability Interpretation Against Software Properties: A Data-Centric Approach

被引:1
|
作者
Karanikiotis, Thomas [1 ]
Papamichail, Michail D. [1 ]
Symeonidis, Andreas L. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Elect & Comp Engn Dept, Informat Proc Lab, Intelligent Syst & Software Engn Labgrp, Thessaloniki, Greece
来源
关键词
Developer-perceived readability; Readability interpretation; Size-based clustering; Support vector regression; SUPPORT;
D O I
10.1007/978-3-030-83007-6_10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the wide adoption of the agile software development paradigm, where efficient collaboration as well as effective maintenance are of utmost importance, the need to produce readable source code is evident. To that end, several research efforts aspire to assess the extent to which a software component is readable. Several metrics and evaluation criteria have been proposed; however, they are mostly empirical or rely on experts who are responsible for determining the ground truth and/or set custom thresholds, leading to results that are context-dependent and subjective. In this work, we employ a large set of static analysis metrics along with various coding violations towards interpreting readability as perceived by developers. Unlike already existing approaches, we refrain from using experts and we provide a fully automated and extendible methodology built upon data residing in online code hosting facilities. We perform static analysis at two levels (method and class) and construct a benchmark dataset that includes more than one million methods and classes covering diverse development scenarios. After performing clustering based on source code size, we employ Support Vector Regression in order to interpret the extent to which a software component is readable against the source code properties: cohesion, inheritance, complexity, coupling, and documentation. The evaluation of our methodology indicates that our models effectively interpret readability as perceived by developers against the above mentioned source code properties.
引用
收藏
页码:203 / 226
页数:24
相关论文
共 50 条
  • [41] Data-centric strategies for deep-learning accelerated salt interpretation
    Gala, Apurva
    Devarakota, Pandu
    DATA-CENTRIC ENGINEERING, 2025, 6
  • [42] Redactor: A Data-Centric and Individualized Defense Against Inference Attacks
    Heo, Geon
    Whang, Steven Euijong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14874 - 14882
  • [43] Analyzing Data-Centric Properties for Graph Contrastive Learning
    Trivedi, Puja
    Lubana, Ekdeep Singh
    Heimann, Mark
    Koutra, Danai
    Thiagarajan, Jayaraman J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] Cooperative approach for data-centric and node-centric misbehavior detection in VANET
    Sultana, Rukhsar
    Grover, Jyoti
    Tripathi, Meenakshi
    VEHICULAR COMMUNICATIONS, 2024, 50
  • [45] A Data-driven Methodology towards Interpreting Readability against Software Properties
    Karanikiotis, Thomas
    Papamichail, Michail D.
    Gonidelis, Ioannis
    Karatza, Dimitra
    Symeonidis, Andreas L.
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 61 - 72
  • [46] Materials data science using CRADLE: A distributed, data-centric approach
    Ciardi, Thomas G.
    Nihar, Arafath
    Chawla, Rounak
    Akanbi, Olatunde
    Tripathi, Pawan K.
    Wu, Yinghui
    Chaudhary, Vipin
    French, Roger H.
    MRS COMMUNICATIONS, 2024, 14 (04) : 601 - 611
  • [47] Have data, will travel: A data-centric approach to enterprise systems development
    Zumbado, J
    Iller, W
    Naecker, PA
    CONFERENCE XXII - GEOSPATIAL INFORMATION & TECHNOLOGY ASSOCIATION, PROCEEDINGS, 1999, : 121 - 131
  • [48] A Data Mesh Approach for Enabling Data-Centric Applications at the Tactical Edge
    Dahdal, Simon
    Poltronieri, Filippo
    Tortonesi, Mauro
    Stefanelli, Cesare
    Suri, Niranjan
    2023 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS, ICMCIS, 2023,
  • [49] A Data-Centric Approach to Quality Estimation of Role Mining Results
    Dong, Lijun
    Wu, Kui
    Tang, Guoming
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (12) : 2678 - 2692
  • [50] A Data-Centric Approach to Understanding the 2020 US Presidential Election
    Srinivasan, Satish Mahadevan
    Paat, Yok-Fong
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (09)