Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics

被引：0

作者：

Deutsch, Daniel ^{[1
]}

Roth, Dan ^{[1
]}

机构：

[1] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022) | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Question answering-based summarization evaluation metrics must automatically determine whether the QA model's prediction is correct or not, a task known as answer verification. In this work, we benchmark the lexical answer verification methods which have been used by current QA-based metrics as well as two more sophisticated text comparison methods, BERTScore and LERC. We find that LERC out-performs the other methods in some settings while remaining statistically indistinguishable from lexical overlap in others. However, our experiments reveal that improved verification performance does not necessarily translate to overall QA-based metric quality: In some scenarios, using a worse verification method - or using none at all - has comparable performance to using the best verification method, a result that we attribute to properties of the datasets.(1)

引用

页码：3759 / 3765

页数：7

共 50 条

[31] Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval
Yanagi, Rintaro
Togo, Ren
Ogawa, Takahiro
Haseyama, Miki
[J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 1 - 11
[32] CS Net: A Coarse-to-Fine-Grained Summarization Network for Community-Based Question Answering Summarization
Fang, Yekun
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2024, 2024, 14885 : 407 - 423
[33] Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey
Asad Ali Shah
Sri Devi Ravana
Suraya Hamid
Maizatul Akmar Ismail
[J]. Knowledge and Information Systems, 2019, 58 : 611 - 650
[34] Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey
Shah, Asad Ali
Ravana, Sri Devi
Hamid, Suraya
Ismail, Maizatul Akmar
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (03) : 611 - 650
[35] Transformer-based Sparse Encoder and Answer Decoder for Visual Question Answering
Peng, Longkun
An, Gaoyun
Ruan, Qiuqi
[J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 120 - 123
[36] Dependency-Based Algorithms for Answer Validation Task in Russian Question Answering
Solovyev, Alexander
[J]. LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 199 - 212
[37] A LF based answer indexing method for encyclopedia question-answering system
Kim, HJ
Wang, JH
Lee, CK
Lee, CH
Jang, MG
[J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 679 - 684
[38] A WWW-based question auto-answering system - Answer Web
Shen, RM
Li, XJ
Xu, NZ
Chen, W
[J]. PROCEEDINGS OF ICCE'98, VOL 2 - GLOBAL EDUCATION ON THE NET, 1998, : 328 - 331
[39] Word/Phrase based Answer Type Classification for Bengali Question Answering System
Islam, Md. Aminul
Kabir, Md. Fasihul
Abdullah-Al-Mamun, Khandaker
Huda, Mohammad Nurul
[J]. 2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 445 - 448
[40] Answer-Based Entity Extraction and Alignment for Visual Text Question Answering
Yu, Jun
Jing, Mohan
Liu, Weihao
Luo, Tongxu
Zhang, Bingyuan
Lu, Keda
Lei, Fangyu
Sun, Jianqing
Liang, Jiaen
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9487 - 9491

← 1 2 3 4 5 →