AMBICOREF: Evaluating Human and Model Sensitivity to Ambiguous Coreference

被引:0
|
作者
Yuan, Yuewei [1 ]
Malaviya, Chaitanya [1 ]
Yatskar, Mark [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a sentence "Abby told Brittney that she upset Courtney", one would struggle to understand who "she" refers to, and ask for clarification. However, if the word "upset" were replaced with "hugged", "she" unambiguously refers to Abby. We study if modern co-reference resolution models are sensitive to such pronominal ambiguity. To this end, we construct AMBICOREF, a diagnostic corpus of minimal sentence pairs with ambiguous and unambiguous referents. Our examples generalize psycholinguistic studies of human perception of ambiguity around particular arrangements of verbs and their arguments. Analysis shows that (1) humans are less sure of referents in ambiguous AmbiCoref examples than unambiguous ones, and (2) most coreference models show little difference in output between ambiguous and unambiguous pairs. We release AMBICOREF as a diagnostic corpus for testing whether models treat ambiguity similarly to humans.(1)
引用
收藏
页码:1023 / 1030
页数:8
相关论文
共 50 条
  • [41] Evaluating contrast sensitivity
    Kitaguchi, Saori
    MacDonald, Lindsay
    Westland, Stephen
    HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [42] Sensitivity analysis of the human body mechanical model
    Ciglaric, I
    Prebil, I
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 2000, 80 : S343 - S344
  • [43] SENSITIVITY FUNCTIONS OF A HUMAN HEAD MOVEMENT MODEL
    ZANGEMEISTER, WH
    ARLT, AC
    LEHMAN, S
    MEDICAL ENGINEERING & PHYSICS, 1994, 16 (02) : 163 - 170
  • [44] Towards a Mention-Pair Model for Coreference Resolution in Portuguese
    Rocha, Gil
    Cardoso, Henrique Lopes
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017), 2017, 10423 : 855 - 867
  • [45] Evaluating the material parameters of the human cornea in a numerical model
    Srodka, Wieslaw
    ACTA OF BIOENGINEERING AND BIOMECHANICS, 2011, 13 (03) : 77 - 85
  • [46] A XENOGENEIC MODEL FOR EVALUATING HUMAN DEMINERALIZED BONE PREPARATIONS
    MARINAK, KW
    TOWLE, HJ
    MELLONIG, JT
    JOURNAL OF DENTAL RESEARCH, 1986, 65 : 295 - 295
  • [47] Evaluating the efficacy of a numerical model of a human anatomy joint
    Vairis, Achilles
    Petousis, Markos
    Vidakis, Nektarios
    Kandyla, Betina
    Chrisoulakis, Christos
    Tsainis, Andreas-Marios
    2013 PROCEEDINGS OF THE 24TH ANNUAL CONFERENCE ON EUROPEAN ASSOCIATION FOR EDUCATION IN ELECTRICAL AND INFORMATION ENGINEERING (EAEEIE), 2013, : 170 - 173
  • [48] Evaluating Shigella flexneri Pathogenesis in the Human Enteroid Model
    Ranganathan, Sridevi
    Doucet, Michele
    Grassel, Christen L.
    Delaine-Elias, BreOnna
    Zachos, Nicholas C.
    Barry, Eileen M.
    INFECTION AND IMMUNITY, 2019, 87 (04)
  • [49] Context sensitivity and invariance in perception of octave-ambiguous tones
    Repp, Bruno H.
    Thompson, Jacqueline M.
    PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2010, 74 (05): : 437 - 456
  • [50] Context sensitivity and invariance in perception of octave-ambiguous tones
    Bruno H. Repp
    Jacqueline M. Thompson
    Psychological Research, 2010, 74 : 437 - 456