BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction Evaluation

被引:0
|
作者
Gashteovski, Kiril [1 ]
Yu, Mingying [1 ,2 ]
Kotnis, Bhushan [1 ]
Lawrence, Carolin [1 ]
Niepert, Mathias [1 ,4 ]
Glavas, Goran [2 ,3 ]
机构
[1] NEC Labs Europe GmbH, Heidelberg, Germany
[2] Univ Mannheim, Mannheim, Germany
[3] Ludwig Maximilians Univ Munchen, Munich, Germany
[4] Univ Stuttgart, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intrinsic evaluations of OIE systems are carried out either manually-with human evaluators judging the correctness of extractions-or automatically, on standardized benchmarks. The latter, while much more cost-effective, is less reliable, primarily because of the incompleteness of the existing OIE benchmarks: the ground truth extractions do not include all acceptable variants of the same fact, leading to unreliable assessment of the models' performance. Moreover, the existing OIE benchmarks are available for English only. In this work, we introduce BenchIE: a benchmark and evaluation framework for comprehensive evaluation of OIE systems for English, Chinese, and German. In contrast to existing OIE benchmarks, BenchIE is fact-based, i.e., it takes into account informational equivalence of extractions: our gold standard consists of fact synsets, clusters in which we exhaustively list all acceptable surface forms of the same fact. Moreover, having in mind common downstream applications for OIE, we make BenchIE multi-faceted; i.e., we create benchmark variants that focus on different facets of OIE evaluation, e.g., compactness or minimality of extractions. We benchmark several state-of-the-art OIE systems using BenchIE and demonstrate that these systems are significantly less effective than indicated by existing OIE benchmarks. We make BenchIE (data and evaluation code) publicly available.(1)
引用
收藏
页码:4472 / 4490
页数:19
相关论文
共 50 条
  • [1] FOR A MULTI-FACETED EVALUATION
    LALONDE, P
    CANADIAN PSYCHIATRIC ASSOCIATION JOURNAL, 1972, 17 (03): : 201 - 203
  • [2] Information: A Multi-Faceted Concept
    Ucak, Nazan Ozenc
    TURKISH LIBRARIANSHIP, 2010, 24 (04) : 705 - 722
  • [3] A multi-faceted framework for the evaluation of intelligent database design tools
    Williams, MD
    Beynon-Davies, P
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 1343 - 1349
  • [4] Issues of Fact-based Information Analysis
    Sharonova, Natalia
    Doroshenko, Anastasiia
    Cherednichenko, Olga
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS. VOL I: MAIN CONFERENCE (COLINS 2018), 2018, 2136 : 11 - 19
  • [5] MULTI-FACETED PROCESSING OF TECHNICAL-INFORMATION
    EISS, MI
    RILEY, SA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1985, 190 (SEP): : 19 - CIF
  • [6] Information in Civil Societies - a multi-faceted approach
    Narayan, Bhuva
    COSMOPOLITAN CIVIL SOCIETIES-AN INTERDISCIPLINARY JOURNAL, 2013, 5 (03): : I - II
  • [7] A new multi-faceted framework for deciphering diplodocid ontogeny
    Woodruff, D. Cary
    Fowler, Denver W.
    Horner, John R.
    PALAEONTOLOGIA ELECTRONICA, 2017, 20 (03)
  • [8] A symbolic framework for multi-faceted security protocol analysis
    Andrea Bracciali
    Gianluigi Ferrari
    Emilio Tuosto
    International Journal of Information Security, 2008, 7 : 55 - 84
  • [9] A symbolic framework for multi-faceted security protocol analysis
    Bracciali, Andrea
    Ferrari, Gianluigi
    Tuosto, Emilio
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2008, 7 (01) : 55 - 84
  • [10] Fact-Based Semantic Modeling in the Information and Behavioural Perspectives
    Bollen, Peter
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 WORKSHOPS, 2014, 8842 : 663 - 666