AMBICOREF: Evaluating Human and Model Sensitivity to Ambiguous Coreference

被引:0
|
作者
Yuan, Yuewei [1 ]
Malaviya, Chaitanya [1 ]
Yatskar, Mark [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a sentence "Abby told Brittney that she upset Courtney", one would struggle to understand who "she" refers to, and ask for clarification. However, if the word "upset" were replaced with "hugged", "she" unambiguously refers to Abby. We study if modern co-reference resolution models are sensitive to such pronominal ambiguity. To this end, we construct AMBICOREF, a diagnostic corpus of minimal sentence pairs with ambiguous and unambiguous referents. Our examples generalize psycholinguistic studies of human perception of ambiguity around particular arrangements of verbs and their arguments. Analysis shows that (1) humans are less sure of referents in ambiguous AmbiCoref examples than unambiguous ones, and (2) most coreference models show little difference in output between ambiguous and unambiguous pairs. We release AMBICOREF as a diagnostic corpus for testing whether models treat ambiguity similarly to humans.(1)
引用
收藏
页码:1023 / 1030
页数:8
相关论文
共 50 条
  • [1] COREFERENCE AND PARALLEL FUNCTION STRATEGY FOR THE AMBIGUOUS ANAPHORIC PRONOUNS
    RONDAL, JA
    LEYEN, N
    BREDART, S
    PEREE, F
    CAHIERS DE PSYCHOLOGIE COGNITIVE-CURRENT PSYCHOLOGY OF COGNITION, 1984, 4 (02): : 151 - 170
  • [2] Coreference Resolution in Ambiguous Pronouns Using BERT and SVM
    Mohan, Monisha
    Nair, Jyothisha J.
    PROCEEDINGS OF THE 2019 9TH INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING AND SYSTEM DESIGN (ISED 2019), 2019, : 68 - 72
  • [3] Evaluating Ambiguous Offerings
    Boulongne, Romain
    Durand, Rodolphe
    ORGANIZATION SCIENCE, 2021, 32 (02) : 257 - 272
  • [4] Evaluating the goodness of various equations to model the contrast sensitivity function of the human eye
    Leube, Alexander
    Schilling, Tim Tobias
    Ohlendorf, Arne
    Wahl, Siegfried
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (12)
  • [5] Evaluating and Improving the Coreference Capabilities of Machine Translation Models
    Yehudai, Asaf
    Cattan, Arie
    Abend, Omri
    Stanovsky, Gabriel
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 980 - 992
  • [6] A Probabilistic Annotation Model for Crowdsourcing Coreference
    Paun, Silviu
    Chamberlain, Jon
    Kruschwitz, Udo
    Yu, Juntao
    Poesio, Massimo
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1926 - 1937
  • [7] Evaluating the state of the art in coreference resolution for electronic medical records
    Uzuner, Ozlem
    Bodnari, Andreea
    Shen, Shuying
    Forbush, Tyler
    Pestian, John
    South, Brett R.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (05) : 786 - 791
  • [8] Evaluating hybrid versus data-driven coreference resolution
    Hendrickx, Iris
    Hoste, Veronique
    Daelemans, Walter
    ANAPHORA: ANALYSIS, ALGORITHMS AND APPLICATIONS, 2007, 4410 : 137 - +
  • [9] Evaluating Ambiguous Questions in Semantic Parsing
    Papicchio, Simone
    Papotti, Paolo
    Cagliero, Luca
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 338 - 342
  • [10] Model-based annotation of coreference
    Aralikatte, Rahul
    Sogaard, Anders
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 74 - 79