Measuring the quality of diff algorithms: a formalization

被引:6
|
作者
Barabucci, Gioele [2 ]
Ciancarini, Paolo [1 ]
Di Iorio, Angelo [1 ]
Vitali, Fabio [1 ]
机构
[1] Univ Bologna, I-40126 Bologna, Italy
[2] Univ Cologne, Cologne, Germany
基金
欧盟第七框架计划;
关键词
Diff algorithms; Output quality; Metrics; Delta model; XML diff; XML DOCUMENTS; EVOLUTION;
D O I
10.1016/j.csi.2015.12.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of differences between documents is a very common task in several domains. This paper introduces a formal way to compare diff algorithms and to analyze the deltas they produce. There is no one-fits all definition for the quality of a delta, because it is strongly related to the application domain and the final use of the detected changes. Researchers have historically focused on minimality: reducing the size of the produced edit scripts and/or taming the computational complexity of the algorithms. Recently they started giving more relevance to the human interpretability of the deltas, designing tools that produce more readable, usable and domain-oriented results. We propose a universal delta model and a set of metrics to characterize and compare effectively deltas produced by different algorithms, in order to highlight what are the most suitable ones for use in a given task and domain. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [21] How different are different diff algorithms in Git? Use --histogram for code changes
    Nugroho, Yusuf Sulistyo
    Hata, Hideaki
    Matsumoto, Kenichi
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 790 - 823
  • [22] MEASURING PARALLELISM IN ALGORITHMS
    JEREB, B
    PIPAN, L
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 34 (1-5): : 49 - 52
  • [23] How different are different diff algorithms in Git?Use --histogram for code changes
    Yusuf Sulistyo Nugroho
    Hideaki Hata
    Kenichi Matsumoto
    Empirical Software Engineering, 2020, 25 : 790 - 823
  • [24] Fuzzy QoS controllers in diff-serv scheduler using genetic algorithms
    Sun, BL
    Yang, Q
    Ma, J
    Chen, H
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 101 - 106
  • [25] A generic and executable formalization of signature-based Grobner basis algorithms
    Maletzky, Alexander
    JOURNAL OF SYMBOLIC COMPUTATION, 2021, 106 : 23 - 47
  • [26] Formalization Description of Artificial Immune Algorithms Based on Intelligent Computation Framework
    Kang, Qi
    Lin, Xiaoqing
    Yan, Yong
    Wang, Lei
    Wu, Qidi
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 599 - 604
  • [28] Formalization of Context-free Grammars by Means of Modified Algebra of Algorithms
    Ovsyak, Volodymyr
    Bui, Dmytro
    Ovsyak, Oleksandr
    Petruszka, Julia
    2015 XI INTERNATIONAL CONFERENCE ON PERSPECTIVE TECHNOLOGIES AND METHODS IN MEMS DESIGN (MEMSTECH), 2015, : 84 - 87
  • [29] Formalization Description of Estimation of Distribution Algorithms Based on Intelligent Computation Framework
    Wang, Lei
    Yan, Yong
    Lin, Xiaoqing
    Kang, Qi
    Wu, Qidi
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 617 - 622
  • [30] Automated Coordinate Measuring Machine Inspection Planning Knowledge Capture and Formalization
    Anagnostakis, Dimitrios
    Ritchie, James
    Lim, Theodore
    Sung, Raymond
    Dewar, Richard
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2018, 18 (03)