Semantic Similarity Based Evaluation for Abstractive News Summarization

被引:0
|
作者
Fikri, Figen Beken [1 ]
Oflazer, Kemal [2 ]
Yanikoglu, Berrin [1 ]
机构
[1] Sabanci Univ, Dept Comp Sci & Engn, Istanbul, Turkey
[2] Carnegie Mellon Univ Qatar, Dept Comp Sci, Doha, Qatar
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
ROUGE is a widely used evaluation metric in text summarization. However, it is not suitable for the evaluation of abstractive summarization systems as it relies on lexical overlap between the gold standard and the generated summaries. This limitation becomes more apparent for agglutinative languages with very large vocabularies and high type/token ratios. In this paper, we present semantic similarity models for Turkish and apply them as evaluation metrics for an abstractive summarization task. To achieve this, we translated the English STSb dataset into Turkish and presented the first semantic textual similarity dataset for Turkish. We showed that our best similarity models have better alignment with average human judgments compared to ROUGE in both Pearson and Spearman correlations.
引用
收藏
页码:24 / 33
页数:10
相关论文
共 50 条
  • [31] Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
    Li, Siyao
    Lei, Deren
    Qin, Pengda
    Wang, William Yang
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6038 - 6044
  • [32] A Study on Ontology based Abstractive Summarization
    Mohan, Jishma M.
    Sunitha, C.
    Ganesh, Amal
    Jaya, A.
    [J]. FOURTH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE & ENGINEERING (ICRTCSE 2016), 2016, 87 : 32 - 37
  • [33] Sentence Pair Embeddings Based Evaluation Metric for Abstractive and Extractive Summarization
    Akula, Ramya
    Garibay, Ivan
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6009 - 6017
  • [34] Automatically Discarding Straplines to Improve Data Quality for Abstractive News Summarization
    Keleg, Amr
    Lindemann, Matthias
    Liu, Danyang
    Long, Wanqiu
    Webber, Bonnie L.
    [J]. PROCEEDINGS OF THE FIRST WORKSHOP ON EFFICIENT BENCHMARKING IN NLP (NLP POWER 2022), 2022, : 42 - 51
  • [35] A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization
    Lei, Yuejie
    Zheng, Fujia
    Yan, Yuanmeng
    He, Keqing
    Xu, Weiran
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1354 - 1364
  • [36] Leveraging Lead Bias for Zero-shot Abstractive News Summarization
    Zhu, Chenguang
    Yang, Ziyi
    Gmyr, Robert
    Zeng, Michael
    Huang, Xuedong
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1462 - 1471
  • [37] Legal public opinion news abstractive summarization by incorporating topic information
    Yuxin Huang
    Zhengtao Yu
    Junjun Guo
    Zhiqiang Yu
    Yantuan Xian
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 2039 - 2050
  • [38] Genetic Semantic Graph Approach for Multi-document Abstractive Summarization
    Khan, Atif
    Salim, Naomie
    Kumar, Yogan Jaya
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC), 2015, : 173 - 181
  • [39] Legal public opinion news abstractive summarization by incorporating topic information
    Huang, Yuxin
    Yu, Zhengtao
    Guo, Junjun
    Yu, Zhiqiang
    Xian, Yantuan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2039 - 2050
  • [40] A Study of Abstractive Summarization Using Semantic Representations and Discourse Level Information
    Valderrama Vilca, Gregory Cesar
    Sobrevilla Cabezudo, Marco Antonio
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 482 - 490