AMBICOREF: Evaluating Human and Model Sensitivity to Ambiguous Coreference

被引:0
|
作者
Yuan, Yuewei [1 ]
Malaviya, Chaitanya [1 ]
Yatskar, Mark [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a sentence "Abby told Brittney that she upset Courtney", one would struggle to understand who "she" refers to, and ask for clarification. However, if the word "upset" were replaced with "hugged", "she" unambiguously refers to Abby. We study if modern co-reference resolution models are sensitive to such pronominal ambiguity. To this end, we construct AMBICOREF, a diagnostic corpus of minimal sentence pairs with ambiguous and unambiguous referents. Our examples generalize psycholinguistic studies of human perception of ambiguity around particular arrangements of verbs and their arguments. Analysis shows that (1) humans are less sure of referents in ambiguous AmbiCoref examples than unambiguous ones, and (2) most coreference models show little difference in output between ambiguous and unambiguous pairs. We release AMBICOREF as a diagnostic corpus for testing whether models treat ambiguity similarly to humans.(1)
引用
收藏
页码:1023 / 1030
页数:8
相关论文
共 50 条
  • [31] RAT MODEL FOR EVALUATING INHIBITORS OF HUMAN RENIN
    PALS, DT
    LAWSON, JA
    COUCH, SJ
    JOURNAL OF PHARMACOLOGICAL METHODS, 1990, 23 (04): : 239 - 245
  • [32] An Approach for Evaluating the Surface Model of Human Trunk
    Guo Chaoyong
    Zhu Haihua
    Li Baofeng
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 828 - 831
  • [33] Evaluating an integrated musculoskeletal model of the human arm
    Soechting, JF
    Flanders, M
    JOURNAL OF BIOMECHANICAL ENGINEERING-TRANSACTIONS OF THE ASME, 1997, 119 (01): : 93 - 102
  • [34] A HUMAN SPHEROID MODEL FOR EVALUATING RADIOSENSITIZING AGENTS
    BARONE, RM
    BYFIELD, JE
    THOMAS, T
    JONES, P
    MURNANE, J
    WARD, J
    PROCEEDINGS OF THE AMERICAN ASSOCIATION FOR CANCER RESEARCH, 1980, 21 (MAR): : 397 - 397
  • [35] Evaluating an integrated musculoskeletal model of the human arm
    Soechting, J.F.
    Flanders, M.
    Journal of Biomechanical Engineering, 1997, 119 (01): : 93 - 102
  • [36] A deep neural network model for coreference resolution in geological domain
    Wan, Bo
    Dong, Shuai
    Chu, Deping
    Li, Hong
    Liu, Yiyang
    Fu, Jinming
    Fang, Fang
    Li, Shengwen
    Zhou, Dan
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [37] Convergence of sensitivity analysis methods for evaluating combined influences of model inputs
    Awad, Majdi
    Kiesse, Tristan Senga
    Assaghir, Zainab
    Ventura, Anne
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 189 : 109 - 122
  • [38] Evaluating the Validity and Sensitivity of the DNDC Model for Shimajiri Dark Red Soil
    Nakagawa, Yoko
    Chin, Yan
    Shiono, Takahiro
    Miyamoto, Teruhito
    Kameyama, Koji
    Shinogi, Yoshiyuki
    JARQ-JAPAN AGRICULTURAL RESEARCH QUARTERLY, 2008, 42 (03): : 163 - 172
  • [39] Evaluating and improving the Community Land Model's sensitivity to land cover
    Meier, Ronny
    Davin, Edouard L.
    Lejeune, Quentin
    Hauser, Mathias
    Li, Yan
    Martens, Brecht
    Schultz, Natalie M.
    Sterling, Shannon
    Thiery, Wim
    BIOGEOSCIENCES, 2018, 15 (15) : 4731 - 4757
  • [40] Evaluating Social Sensitivity
    Harden, Mary
    TEACHERS COLLEGE RECORD, 1941, 42 (06): : 516 - 533