A new evaluation measure using compression dissimilarity on text summarization

被引:6
|
作者
Wang, Tong [1 ]
Chen, Ping [2 ]
Simovici, Dan [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Univ Massachusetts, Dept Comp Engn, Boston, MA 02125 USA
关键词
Summarization evaluation; Compression;
D O I
10.1007/s10489-015-0747-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation of automatic text summarization is a challenging task due to the difficulty of calculating similarity of two texts. In this paper, we define a new dissimilarity measure-compression dissimilarity to compute the dissimilarity between documents. Then we propose a new automatic evaluating method based on compression dissimilarity. The proposed method is a completely "black box" and does not need preprocessing steps. Experiments show that compression dissimilarity could clearly distinct automatic summaries from human summaries. Compression dissimilarity evaluating measure could evaluate an automatic summary by comparing with high-quality human summaries, or comparing with its original document. The evaluating results are highly correlated with human assessments, and the correlation between compression dissimilarity of summaries and compression dissimilarity of documents can serve as a meaningful measure to evaluate the consistency of an automatic text summarization system.
引用
收藏
页码:127 / 134
页数:8
相关论文
共 50 条
  • [1] A new evaluation measure using compression dissimilarity on text summarization
    Tong Wang
    Ping Chen
    Dan Simovici
    [J]. Applied Intelligence, 2016, 45 : 127 - 134
  • [2] Clustering workflow requirements using compression dissimilarity measure
    Wei, Li
    Handley, John
    Martin, Nathaniel
    Sun, Tong
    Keogh, Eamonn
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 50 - 54
  • [3] Text Classification Using Compression-Based Dissimilarity Measures
    Coutinho, David Pereira
    Figueiredo, Mario A. T.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (05)
  • [4] SYMBOLIC CLUSTERING USING A NEW DISSIMILARITY MEASURE
    GOWDA, KC
    DIDAY, E
    [J]. PATTERN RECOGNITION, 1991, 24 (06) : 567 - 578
  • [5] Video Summarization Using Feature Dissimilarity
    Kim, Hyuncheol
    Yoon, Inhye
    Kim, TaeYong
    Paik, Joonki
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATIONS (ICEIC), 2016,
  • [6] Text summarization evaluation using semantic probability distributions
    Le, Anh
    Wu, Fred
    Vu, Lan
    Le, Thanh
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 207 - 212
  • [7] Neural Extractive Text Summarization with Syntactic Compression
    Xu, Jiacheng
    Durrett, Greg
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3292 - 3303
  • [8] EVALUATION MEASURES FOR TEXT SUMMARIZATION
    Steinberger, Josef
    Jezek, Karel
    [J]. COMPUTING AND INFORMATICS, 2009, 28 (02) : 251 - 275
  • [9] new graph based text segmentation using Wikipedia for automatic text summarization
    Pourvali, Mohsen
    Abadeh, Mohammad Saniee
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (01) : 35 - 39
  • [10] Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset
    Ertam, Fatih
    Aydin, Galip
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (09):