Realistic Evaluation Principles for Cross-document Coreference Resolution

被引:0
|
作者
Cattan, Arie [1 ]
Eirew, Alon [1 ,2 ]
Stanovsky, Gabriel [3 ]
Joshi, Mandar [4 ]
Dagan, Ido [1 ]
机构
[1] Bar Ilan Univ, Comp Sci Dept, Ramat Gan, Israel
[2] Intel Labs, Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Jerusalem, Israel
[4] Univ Washington, Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
基金
以色列科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We point out that common evaluation practices for cross-document coreference resolution have been unrealistically permissive in their assumed settings, yielding inflated results. We propose addressing this issue via two evaluation methodology principles. First, as in other tasks, models should be evaluated on predicted mentions rather than on gold mentions. Doing this raises a subtle issue regarding singleton coreference clusters, which we address by decoupling the evaluation of mention detection from that of coreference linking. Second, we argue that models should not exploit the synthetic topic structure of the standard ECB+ dataset, forcing models to confront the lexical ambiguity challenge, as intended by the dataset creators. We demonstrate empirically the drastic impact of our more realistic evaluation principles on a competitive model, yielding a score which is 33 F1 lower compared to evaluating by prior lenient practices.(1)
引用
收藏
页码:143 / 151
页数:9
相关论文
共 50 条
  • [1] An Overview of Cross-Document Coreference Resolution
    Keshtkaran, Aliakbar
    Yuhaniz, Siti Sophiayati
    Ibrahim, Suhaimi
    [J]. 2017 FIRST INTERNATIONAL CONFERENCE ON COMPUTER AND DRONE APPLICATIONS (ICONDA), 2017, : 43 - 48
  • [2] The Challenges of Cross-Document Coreference Resolution in Email
    Li, Xue
    Magliacane, Sara
    Groth, Paul
    [J]. PROCEEDINGS OF THE 11TH KNOWLEDGE CAPTURE CONFERENCE (K-CAP '21), 2021, : 273 - 276
  • [3] XCoref: Cross-document Coreference Resolution in the Wild
    Zhukova, Anastasia
    Hamborg, Felix
    Donnay, Karsten
    Gipp, Bela
    [J]. INFORMATION FOR A BETTER WORLD: SHAPING THE GLOBAL FUTURE, PT I, 2022, 13192 : 272 - 291
  • [4] Cross-document Coreference Resolution over Predicted Mentions
    Cattan, Arie
    Eirew, Alon
    Stanovsky, Gabriel
    Joshi, Mandar
    Dagan, Ido
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5100 - 5107
  • [5] Cross-document transliterated personal name coreference resolution
    Wang, HF
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 11 - 20
  • [6] A methodology for cross-document coreference
    Bagga, A
    Biermann, AW
    [J]. PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 207 - 210
  • [7] Cross-Document Coreference Resolution based on Automatic Text Summary
    Gao, Sanyuan
    Li, Si
    Xu, Weiran
    Guo, Jun
    [J]. THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 306 - 309
  • [8] Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution
    Barhom, Shany
    Shwartz, Vered
    Eirew, Alon
    Bugert, Michael
    Reimers, Nils
    Dagan, Ido
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4179 - 4189
  • [9] A Cross-Document Coreference Resolution Approach to Low-Resource Languages
    Theptakob, Nathanon
    Ayutthaya, Thititorn Seneewong Na
    Saetia, Chanatip
    Chalothorn, Tawunrat
    Buabthong, Pakpoom
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2023, 2023, 14118 : 422 - 431
  • [10] Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities
    Hsu, Benjamin
    Horwood, Graham
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3644 - 3655