The BETTER Cross-Language Information Retrieval Datasets

被引:3
|
作者
Soboroff, Ian [1 ]
机构
[1] NIST, Gaithersburg, MD 20899 USA
关键词
information retrieval; test collection; information extraction;
D O I
10.1145/3539618.3591910
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The IARPA BETTER (Better Extraction from Text Through Enhanced Retrieval) program held three evaluations of information retrieval (IR) and information extraction (IE). For both tasks, the only training data available was in English, but systems had to perform cross-language retrieval and extraction from Arabic, Farsi, Chinese, Russian, and Korean. Pooled assessment and information extraction annotation were used to create reusable IR test collections. These datasets are freely available to researchers working in cross-language retrieval, information extraction, or the conjunction of IR and IE. This paper describes the datasets, how they were constructed, and how they might be used by researchers.
引用
收藏
页码:3047 / 3053
页数:7
相关论文
共 50 条
  • [41] Cross-Language Retrieval with Wikipedia
    Schoenhofen, Peter
    Benczur, Andras
    Biro, Istvan
    Csalogany, Karoly
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 72 - 79
  • [42] Categorization-driven cross-language retrieval of medical information
    Freitas, HR
    Ribeiro-Neto, B
    Vale, RF
    Laender, AHF
    Lima, LRS
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (04): : 501 - 510
  • [43] Billingual Formal Concept Analysis for Cross-Language Information Retrieval
    Ali, Chedi Bechikh
    Haddad, Hatem
    Slimani, Yahia
    2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 922 - 928
  • [44] Support for interactive document selection in cross-language information retrieval
    Oard, DW
    Resnik, P
    INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (03) : 363 - 379
  • [45] Research on Chinese-English cross-language information retrieval
    Zhang, Tao
    Zhang, Yue-Jie
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2591 - +
  • [46] Dictionary-based techniques for cross-language information retrieval
    Levow, GA
    Oard, DW
    Resnik, P
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 523 - 547
  • [47] Utilizing Images for Assisting Cross-language Information Retrieval on the Web
    Hayashi, Yoshihiko
    Bora, Savas Ali
    Nagata, Masaaki
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 100 - +
  • [48] Creating and exploiting a comparable corpus in cross-language information retrieval
    Talvensaari, Tuomas
    Laurikkala, Jorma
    Jarvelin, Kalervo
    Juhola, Martti
    Keskustalo, Heikki
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2007, 25 (01)
  • [49] Research on English-Chinese cross-language information retrieval
    Zhang, Yue-Jie
    Zhang, Tao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3448 - +
  • [50] How to compare bilingual to monolingual cross-language information retrieval
    Crivellari, Franco
    Di Nunzio, Giorgio Maria
    Ferro, Nicola
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 533 - +