Measuring the quality of diff algorithms: a formalization

被引:6
|
作者
Barabucci, Gioele [2 ]
Ciancarini, Paolo [1 ]
Di Iorio, Angelo [1 ]
Vitali, Fabio [1 ]
机构
[1] Univ Bologna, I-40126 Bologna, Italy
[2] Univ Cologne, Cologne, Germany
基金
欧盟第七框架计划;
关键词
Diff algorithms; Output quality; Metrics; Delta model; XML diff; XML DOCUMENTS; EVOLUTION;
D O I
10.1016/j.csi.2015.12.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of differences between documents is a very common task in several domains. This paper introduces a formal way to compare diff algorithms and to analyze the deltas they produce. There is no one-fits all definition for the quality of a delta, because it is strongly related to the application domain and the final use of the detected changes. Researchers have historically focused on minimality: reducing the size of the produced edit scripts and/or taming the computational complexity of the algorithms. Recently they started giving more relevance to the human interpretability of the deltas, designing tools that produce more readable, usable and domain-oriented results. We propose a universal delta model and a set of metrics to characterize and compare effectively deltas produced by different algorithms, in order to highlight what are the most suitable ones for use in a given task and domain. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [31] Formalization and Automation of Quality Assurance Processes in Radiation Oncology
    Munbodh, R.
    Zaveri, H.
    Caminati, M.
    Bowles, J.
    MEDICAL PHYSICS, 2018, 45 (06) : E274 - E274
  • [32] Algorithms for processing measuring data in quality control of tempering of steel products by losses on remagnetization
    Morozova, V.M.
    Ponomarev, V.S.
    Remez, N.V.
    The Soviet journal of nondestructive testing, 1991, 26 (08): : 567 - 572
  • [33] On measuring the accuracy of SLAM algorithms
    Kuemmerle, Rainer
    Steder, Bastian
    Dornhege, Christian
    Ruhnke, Michael
    Grisetti, Giorgio
    Stachniss, Cyrill
    Kleiner, Alexander
    AUTONOMOUS ROBOTS, 2009, 27 (04) : 387 - 407
  • [34] Measuring texture classification algorithms
    Smith, G
    Burns, I
    PATTERN RECOGNITION LETTERS, 1997, 18 (14) : 1495 - 1501
  • [35] On measuring the accuracy of SLAM algorithms
    Rainer Kümmerle
    Bastian Steder
    Christian Dornhege
    Michael Ruhnke
    Giorgio Grisetti
    Cyrill Stachniss
    Alexander Kleiner
    Autonomous Robots, 2009, 27 : 387 - 407
  • [36] Measuring Racial Discrimination in Algorithms
    Arnold, David
    Dobbie, Will
    Hull, Peter
    AEA PAPERS AND PROCEEDINGS, 2021, 111 : 49 - 54
  • [37] Measuring concurrency of parallel algorithms
    Skrinarova, Jarmila
    Melichercik, Miroslav
    PROCEEDINGS OF THE 2008 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, 2008, : 289 - 292
  • [38] Jug measuring: Algorithms and complexity
    Shieh, Min-Zheng
    Tsai, Shi-Chun
    THEORETICAL COMPUTER SCIENCE, 2008, 396 (1-3) : 50 - 62
  • [39] FORMALIZATION OF INTERRELATIONS BETWEEN OPERATORS AND DATA WITHIN THE FRAMEWORK OF AN EXTENDED ALGEBRA OF ALGORITHMS
    Akulovsky, V. G.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2008, 44 (06) : 941 - 950
  • [40] Teaching the Formalization of Mathematical Theories and Algorithms via the Automatic Checking of Finite Models
    Schreiner, Wolfgang
    Brunhuemer, Alexander
    Fuerst, Christoph
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2018, (267): : 120 - 139