Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics

被引:0
|
作者
Deutsch, Daniel [1 ]
Roth, Dan [1 ]
机构
[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering-based summarization evaluation metrics must automatically determine whether the QA model's prediction is correct or not, a task known as answer verification. In this work, we benchmark the lexical answer verification methods which have been used by current QA-based metrics as well as two more sophisticated text comparison methods, BERTScore and LERC. We find that LERC out-performs the other methods in some settings while remaining statistically indistinguishable from lexical overlap in others. However, our experiments reveal that improved verification performance does not necessarily translate to overall QA-based metric quality: In some scenarios, using a worse verification method - or using none at all - has comparable performance to using the best verification method, a result that we attribute to properties of the datasets.(1)
引用
收藏
页码:3759 / 3765
页数:7
相关论文
共 50 条
  • [31] Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 1 - 11
  • [32] CS Net: A Coarse-to-Fine-Grained Summarization Network for Community-Based Question Answering Summarization
    Fang, Yekun
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2024, 2024, 14885 : 407 - 423
  • [33] Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey
    Asad Ali Shah
    Sri Devi Ravana
    Suraya Hamid
    Maizatul Akmar Ismail
    [J]. Knowledge and Information Systems, 2019, 58 : 611 - 650
  • [34] Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey
    Shah, Asad Ali
    Ravana, Sri Devi
    Hamid, Suraya
    Ismail, Maizatul Akmar
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (03) : 611 - 650
  • [35] Transformer-based Sparse Encoder and Answer Decoder for Visual Question Answering
    Peng, Longkun
    An, Gaoyun
    Ruan, Qiuqi
    [J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 120 - 123
  • [36] Dependency-Based Algorithms for Answer Validation Task in Russian Question Answering
    Solovyev, Alexander
    [J]. LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 199 - 212
  • [37] A LF based answer indexing method for encyclopedia question-answering system
    Kim, HJ
    Wang, JH
    Lee, CK
    Lee, CH
    Jang, MG
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 679 - 684
  • [38] A WWW-based question auto-answering system - Answer Web
    Shen, RM
    Li, XJ
    Xu, NZ
    Chen, W
    [J]. PROCEEDINGS OF ICCE'98, VOL 2 - GLOBAL EDUCATION ON THE NET, 1998, : 328 - 331
  • [39] Word/Phrase based Answer Type Classification for Bengali Question Answering System
    Islam, Md. Aminul
    Kabir, Md. Fasihul
    Abdullah-Al-Mamun, Khandaker
    Huda, Mohammad Nurul
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 445 - 448
  • [40] Answer-Based Entity Extraction and Alignment for Visual Text Question Answering
    Yu, Jun
    Jing, Mohan
    Liu, Weihao
    Luo, Tongxu
    Zhang, Bingyuan
    Lu, Keda
    Lei, Fangyu
    Sun, Jianqing
    Liang, Jiaen
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9487 - 9491