The BETTER Cross-Language Information Retrieval Datasets

被引:0
|
作者
Soboroff, Ian [1 ]
机构
[1] NIST, Gaithersburg, MD 20899 USA
关键词
information retrieval; test collection; information extraction;
D O I
10.1145/3539618.3591910
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The IARPA BETTER (Better Extraction from Text Through Enhanced Retrieval) program held three evaluations of information retrieval (IR) and information extraction (IE). For both tasks, the only training data available was in English, but systems had to perform cross-language retrieval and extraction from Arabic, Farsi, Chinese, Russian, and Korean. Pooled assessment and information extraction annotation were used to create reusable IR test collections. These datasets are freely available to researchers working in cross-language retrieval, information extraction, or the conjunction of IR and IE. This paper describes the datasets, how they were constructed, and how they might be used by researchers.
引用
收藏
页码:3047 / 3053
页数:7
相关论文
共 50 条
  • [1] Cross-language information retrieval
    Nie J.-Y.
    [J]. Synthesis Lectures on Human Language Technologies, 2010, 3 (01): : 1 - 142
  • [2] Cross-Language Information Retrieval
    Federico, Marcello
    [J]. COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 411 - 412
  • [3] Cross-language information retrieval
    Oard, DW
    Diekema, AR
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1998, 33 : 223 - 256
  • [4] Study on cross-language information retrieval
    Si, Shen
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 6 - 10
  • [5] Cross-language multimedia information retrieval
    Flank, S
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 13 - 20
  • [6] Cross-language Information Retrieval Based on Multiple Information
    Liu, Pengyuan
    Zheng, Zhijun
    Su, Qi
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 623 - 626
  • [7] Translation Techniques in Cross-Language Information Retrieval
    Zhou, Dong
    Truran, Mark
    Brailsford, Tim
    Wade, Vincent
    Ashman, Helen
    [J]. ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [8] Translation Ambiguity in Cross-Language Information Retrieval
    Sadat, Fatiha
    [J]. BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 1-2, 2010, : 301 - 303
  • [9] Neural Methods for Cross-Language Information Retrieval
    Yang, Eugene
    Lawrie, Dawn
    Mayfield, James
    Nair, Suraj
    Oard, Douglas W.
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3430 - 3431
  • [10] Arabic Cross-Language Information Retrieval: A Review
    Elayeb, Bilel
    Bounhas, Ibrahim
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (03)