Measuring the quality of diff algorithms: a formalization

被引:6
|
作者
Barabucci, Gioele [2 ]
Ciancarini, Paolo [1 ]
Di Iorio, Angelo [1 ]
Vitali, Fabio [1 ]
机构
[1] Univ Bologna, I-40126 Bologna, Italy
[2] Univ Cologne, Cologne, Germany
基金
欧盟第七框架计划;
关键词
Diff algorithms; Output quality; Metrics; Delta model; XML diff; XML DOCUMENTS; EVOLUTION;
D O I
10.1016/j.csi.2015.12.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of differences between documents is a very common task in several domains. This paper introduces a formal way to compare diff algorithms and to analyze the deltas they produce. There is no one-fits all definition for the quality of a delta, because it is strongly related to the application domain and the final use of the detected changes. Researchers have historically focused on minimality: reducing the size of the produced edit scripts and/or taming the computational complexity of the algorithms. Recently they started giving more relevance to the human interpretability of the deltas, designing tools that produce more readable, usable and domain-oriented results. We propose a universal delta model and a set of metrics to characterize and compare effectively deltas produced by different algorithms, in order to highlight what are the most suitable ones for use in a given task and domain. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [41] Method, formalization, and algorithms to split topology models for distributed cloud application deployments
    Karoline Saatkamp
    Uwe Breitenbücher
    Oliver Kopp
    Frank Leymann
    Computing, 2020, 102 : 343 - 363
  • [42] Method, formalization, and algorithms to split topology models for distributed cloud application deployments
    Saatkamp, Karoline
    Breitenbuecher, Uwe
    Kopp, Oliver
    Leymann, Frank
    COMPUTING, 2020, 102 (02) : 343 - 363
  • [43] ALGORITHMS FOR PROCESSING MEASURING DATA IN QUALITY-CONTROL OF TEMPERING OF STEEL PRODUCTS BY LOSSES ON REMAGNETIZATION
    MOROZOVA, VM
    PONOMAREV, VS
    REMEZ, NV
    SOVIET JOURNAL OF NONDESTRUCTIVE TESTING-USSR, 1990, 26 (08): : 567 - 572
  • [44] Measuring the formalization of community sports clubs: findings from the UK, Germany and Australia
    Nichols, Geoff
    Wicker, Pamela
    Cuskelly, Graham
    Breuer, Christoph
    INTERNATIONAL JOURNAL OF SPORT POLICY AND POLITICS, 2015, 7 (02) : 283 - 300
  • [45] Calibration-Free 3D Indoor Positioning Algorithms Based on DNN and DIFF
    Yang, Jingmin
    Deng, Shanghui
    Xu, Li
    Zhang, Wenjie
    SENSORS, 2022, 22 (15)
  • [46] Formalization and computation of quality measures based on electronic medical records
    Dentler, Kathrin
    Numans, Mattijs E.
    ten Teije, Annette
    Cornet, Ronald
    de Keizer, Nicolette F.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 285 - 291
  • [47] Measuring quality
    Swichtenberg, B
    WEM-WATER ENGINEERING & MANAGEMENT, 2002, 149 (11): : 5 - 5
  • [48] Measuring 'quality'
    Williams, Thomas A.
    Quality, 2003, 42 (10):
  • [49] Measuring quality
    Willis, Cameron D.
    Evans, Susan M.
    Stoelwinder, Johannes U.
    Cameron, Peter A.
    AUSTRALIAN HEALTH REVIEW, 2007, 31 (02) : 276 - 281
  • [50] Measuring quality
    Stephenson, J
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1996, 275 (19): : 1466 - 1466