AMBICOREF: Evaluating Human and Model Sensitivity to Ambiguous Coreference

被引：0

作者：

Yuan, Yuewei ^{[1
]}

Malaviya, Chaitanya ^{[1
]}

Yatskar, Mark ^{[1
]}

机构：

[1] Univ Penn, Philadelphia, PA 19104 USA

来源：

17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given a sentence "Abby told Brittney that she upset Courtney", one would struggle to understand who "she" refers to, and ask for clarification. However, if the word "upset" were replaced with "hugged", "she" unambiguously refers to Abby. We study if modern co-reference resolution models are sensitive to such pronominal ambiguity. To this end, we construct AMBICOREF, a diagnostic corpus of minimal sentence pairs with ambiguous and unambiguous referents. Our examples generalize psycholinguistic studies of human perception of ambiguity around particular arrangements of verbs and their arguments. Analysis shows that (1) humans are less sure of referents in ambiguous AmbiCoref examples than unambiguous ones, and (2) most coreference models show little difference in output between ambiguous and unambiguous pairs. We release AMBICOREF as a diagnostic corpus for testing whether models treat ambiguity similarly to humans.(1)

引用

页码：1023 / 1030

页数：8

共 50 条

[1] COREFERENCE AND PARALLEL FUNCTION STRATEGY FOR THE AMBIGUOUS ANAPHORIC PRONOUNS
RONDAL, JA
LEYEN, N
BREDART, S
PEREE, F
CAHIERS DE PSYCHOLOGIE COGNITIVE-CURRENT PSYCHOLOGY OF COGNITION, 1984, 4 (02): : 151 - 170
[2] Coreference Resolution in Ambiguous Pronouns Using BERT and SVM
Mohan, Monisha
Nair, Jyothisha J.
PROCEEDINGS OF THE 2019 9TH INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING AND SYSTEM DESIGN (ISED 2019), 2019, : 68 - 72
[3] Evaluating Ambiguous Offerings
Boulongne, Romain
Durand, Rodolphe
ORGANIZATION SCIENCE, 2021, 32 (02) : 257 - 272
[4] Evaluating the goodness of various equations to model the contrast sensitivity function of the human eye
Leube, Alexander
Schilling, Tim Tobias
Ohlendorf, Arne
Wahl, Siegfried
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (12)
[5] Evaluating and Improving the Coreference Capabilities of Machine Translation Models
Yehudai, Asaf
Cattan, Arie
Abend, Omri
Stanovsky, Gabriel
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 980 - 992
[6] A Probabilistic Annotation Model for Crowdsourcing Coreference
Paun, Silviu
Chamberlain, Jon
Kruschwitz, Udo
Yu, Juntao
Poesio, Massimo
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1926 - 1937
[7] Evaluating the state of the art in coreference resolution for electronic medical records
Uzuner, Ozlem
Bodnari, Andreea
Shen, Shuying
Forbush, Tyler
Pestian, John
South, Brett R.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (05) : 786 - 791
[8] Evaluating hybrid versus data-driven coreference resolution
Hendrickx, Iris
Hoste, Veronique
Daelemans, Walter
ANAPHORA: ANALYSIS, ALGORITHMS AND APPLICATIONS, 2007, 4410 : 137 - +
[9] Evaluating Ambiguous Questions in Semantic Parsing
Papicchio, Simone
Papotti, Paolo
Cagliero, Luca
2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 338 - 342
[10] Model-based annotation of coreference
Aralikatte, Rahul
Sogaard, Anders
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 74 - 79

← 1 2 3 4 5 →