The BETTER Cross-Language Information Retrieval Datasets

被引:3
|
作者
Soboroff, Ian [1 ]
机构
[1] NIST, Gaithersburg, MD 20899 USA
关键词
information retrieval; test collection; information extraction;
D O I
10.1145/3539618.3591910
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The IARPA BETTER (Better Extraction from Text Through Enhanced Retrieval) program held three evaluations of information retrieval (IR) and information extraction (IE). For both tasks, the only training data available was in English, but systems had to perform cross-language retrieval and extraction from Arabic, Farsi, Chinese, Russian, and Korean. Pooled assessment and information extraction annotation were used to create reusable IR test collections. These datasets are freely available to researchers working in cross-language retrieval, information extraction, or the conjunction of IR and IE. This paper describes the datasets, how they were constructed, and how they might be used by researchers.
引用
收藏
页码:3047 / 3053
页数:7
相关论文
共 50 条
  • [21] Adjusting Machine Translation Datasets for Document-Level Cross-Language Information Retrieval: Methodology
    Shtekh, Gennady
    Kazakova, Polina
    Nikitinsky, Nikita
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 84 - 94
  • [22] Using Mutual Information Technique in Cross-Language Information Retrieval
    Sari, Syandra
    Adriani, Mirna
    DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 276 - +
  • [23] Word sense disambiguation for cross-language information retrieval
    Liu, MX
    Diamond, T
    Diekema, AR
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B35 - B40
  • [24] Online Learning to Rank for Cross-Language Information Retrieval
    Rahimi, Razieh
    Shakery, Azadeh
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1033 - 1036
  • [25] Comparative evaluation of cross-language information retrieval systems
    Peters, Carol
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2005, 3379 LNCS : 152 - 161
  • [26] Exploiting Comparable Corpora for Cross-Language Information Retrieval
    Sadat, Fatiha
    PRICAI 2010: TRENDS IN ARTIFICIAL INTELLIGENCE, 2010, 6230 : 662 - 667
  • [27] Query Expansion for Personalized Cross-Language Information Retrieval
    Zhou, Dong
    Lawless, Seamus
    Liu, Jianxun
    Zhang, Sanrong
    Xu, Yu
    10TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION SMAP 2015, 2015, : 18 - 22
  • [28] Fast document translation for cross-language information retrieval
    McCarley, JS
    Roukos, S
    MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 150 - 157
  • [29] Term Discrimination Value for Cross-Language Information Retrieval
    Montazeralghaem, Ali
    Rahimi, Razieh
    Allan, James
    PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19), 2019, : 136 - 139
  • [30] On-Demand Associative Cross-Language Information Retrieval
    Geraldo, Andre Pinto
    Moreira, Viviane P.
    Goncalves, Marcos A.
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 165 - +