A new evaluation measure using compression dissimilarity on text summarization

被引:0
|
作者
Tong Wang
Ping Chen
Dan Simovici
机构
[1] University of Massachusetts Boston,Department of Computer Science
[2] University of Massachusetts Boston,Department of Computer Engineering
来源
Applied Intelligence | 2016年 / 45卷
关键词
Summarization evaluation; Compression;
D O I
暂无
中图分类号
学科分类号
摘要
Evaluation of automatic text summarization is a challenging task due to the difficulty of calculating similarity of two texts. In this paper, we define a new dissimilarity measure – compression dissimilarity to compute the dissimilarity between documents. Then we propose a new automatic evaluating method based on compression dissimilarity. The proposed method is a completely “black box” and does not need preprocessing steps. Experiments show that compression dissimilarity could clearly distinct automatic summaries from human summaries. Compression dissimilarity evaluating measure could evaluate an automatic summary by comparing with high-quality human summaries, or comparing with its original document. The evaluating results are highly correlated with human assessments, and the correlation between compression dissimilarity of summaries and compression dissimilarity of documents can serve as a meaningful measure to evaluate the consistency of an automatic text summarization system.
引用
收藏
页码:127 / 134
页数:7
相关论文
共 50 条
  • [21] Improving Graph Based Multidocument Text Summarization Using an Enhanced Sentence Similarity Measure
    Sarkar, Kamal
    Saraf, Khushbu
    Ghosh, Avishikta
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, : 359 - 365
  • [22] Text summarization using topic-based vector space model and semantic measure
    Belwal, Ramesh Chandra
    Rai, Sawan
    Gupta, Atul
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
  • [23] Cross-Language Text Summarization Using Sentence and Multi-Sentence Compression
    Pontes, Elvys Linhares
    Huet, Stephane
    Torres-Moreno, Juan-Manuel
    Linhares, Andrea Carneiro
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 467 - 479
  • [24] Improving Compression Based Dissimilarity Measure for Music Score Analysis
    Takamoto, Ayaka
    Umemura, Mayu
    Yoshida, Mitsuo
    Umemura, Kyoji
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS - CONCEPTS, THEORY AND APPLICATION (ICAICTA), 2016,
  • [25] Experimental investigating the F-measure as similarity measure for automatic text summarization
    Alguliev, Rasim M.
    Aliguliyev, Ramiz M.
    [J]. APPLIED AND COMPUTATIONAL MATHEMATICS, 2007, 6 (02): : 278 - 287
  • [26] A NEW DISSIMILARITY MEASURE FOR CUT DETECTION USING BIPARTITE GRAPH MATCHING
    Ferzoli Guimaraes, Silvio Jamil
    Goncalves do Patrocinio, Zenilton Kleber, Jr.
    de Paula, Hugo Bastos
    da Silva, Henrique Batista
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2009, 3 (02) : 155 - 181
  • [27] Task-based evaluation of text summarization using relevance prediction
    Hobson, Stacy President
    Dorr, Bonnie J.
    Monz, Christof
    Schwartz, Richard
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1482 - 1499
  • [28] Towards a Reliable Text Summarization Evaluation Metric Using Predictive Models
    Zhao, Bo
    Lui, Yui Man
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (10)
  • [29] A New Technique for Extrinsic Text Summarization
    Kindo, Nishita
    Bhuyan, Gananatha
    Padhy, Ronali
    [J]. COMPUTING AND NETWORK SUSTAINABILITY, 2019, 75
  • [30] An adaptation of a F-measure for automatic text summarization by extraction
    Boudia, Mohamed Amine
    Hamou, Reda Mohamed
    Amine, Abdelmalek
    Lokbani, Ahmed Chaouki
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03): : 2389 - 2398