Measuring the quality of diff algorithms: a formalization

被引:6
|
作者
Barabucci, Gioele [2 ]
Ciancarini, Paolo [1 ]
Di Iorio, Angelo [1 ]
Vitali, Fabio [1 ]
机构
[1] Univ Bologna, I-40126 Bologna, Italy
[2] Univ Cologne, Cologne, Germany
基金
欧盟第七框架计划;
关键词
Diff algorithms; Output quality; Metrics; Delta model; XML diff; XML DOCUMENTS; EVOLUTION;
D O I
10.1016/j.csi.2015.12.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The automatic detection of differences between documents is a very common task in several domains. This paper introduces a formal way to compare diff algorithms and to analyze the deltas they produce. There is no one-fits all definition for the quality of a delta, because it is strongly related to the application domain and the final use of the detected changes. Researchers have historically focused on minimality: reducing the size of the produced edit scripts and/or taming the computational complexity of the algorithms. Recently they started giving more relevance to the human interpretability of the deltas, designing tools that produce more readable, usable and domain-oriented results. We propose a universal delta model and a set of metrics to characterize and compare effectively deltas produced by different algorithms, in order to highlight what are the most suitable ones for use in a given task and domain. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 50 条
  • [1] Algorithms for Measuring the Quality of Products and Their Characteristics
    V. V. Ryzhakov
    M. V. Ryzhakov
    K. V. Ryzhakov
    Measurement Techniques, 2003, 46 : 433 - 438
  • [2] Algorithms for measuring the quality of products and their characteristics
    Ryzhakov, VV
    Ryzhakov, MV
    Ryzhakov, KV
    MEASUREMENT TECHNIQUES, 2003, 46 (05) : 433 - 438
  • [3] CLASSES OF ALGORITHMS: FORMALIZATION AND COMPARISON
    Gurevich, Yuri
    Grigorieff, Serge
    Valarcher, Pierre
    BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2012, (107): : 95 - 127
  • [4] Measuring image quality of mesh simplification algorithms
    Yuan, P
    Green, M
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN & COMPUTER GRAPHICS, 1999, : 731 - 735
  • [5] Formalization Quality in Isabelle
    Huch, Fabian
    Stathopoulos, Yiannos
    INTELLIGENT COMPUTER MATHEMATICS, CICM 2023, 2023, 14101 : 142 - 157
  • [6] On the formalization of asynchronous first passage algorithms
    Sbailo, Luigi
    Delle Site, Luigi
    JOURNAL OF CHEMICAL PHYSICS, 2019, 150 (13):
  • [7] FORMALIZATION OF ALGORITHMS FOR RELATIONAL DATABASE MACHINES
    RYVKIN, VM
    KOMAROV, PI
    NAZAROV, AS
    PROGRAMMING AND COMPUTER SOFTWARE, 1986, 12 (01) : 36 - 43
  • [8] The distributed constraint satisfaction problem: Formalization and algorithms
    Yokoo, M
    Durfee, EH
    Ishida, T
    Kuwabara, K
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (05) : 673 - 685
  • [9] Formalization of graph search algorithms and its applications
    Yamamoto, M
    Takahashi, K
    Hagiya, M
    Nishizaki, S
    Tamai, T
    THEOREM PROVING IN HIGHER ORDER LOGICS, 1998, 1479 : 479 - 496
  • [10] Measuring Sample Quality in Algorithms for Intractable Normalizing Function Problems
    Kang, Bokgyeong
    Hughes, John
    Haran, Murali
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 32