High-Recall Information Retrieval from Linked Big Data

被引:18
|
作者
Cuzzocrea, Alfredo [1 ,2 ]
Lee, Wookey [3 ]
Leung, Carson K. [4 ]
机构
[1] CNR, ICAR, Arcavacata Di Rende, CS, Italy
[2] Univ Calabria, Arcavacata Di Rende, CS, Italy
[3] Inha Univ, Inchon, South Korea
[4] Univ Manitoba, Winnipeg, MB, Canada
关键词
Information retrieval; recall; big data; linked data; applications; SEARCH;
D O I
10.1109/COMPSAC.2015.152
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the current era of big data, high volumes of valuable information are available in collections of documents, the web, social networks, and high varieties of linked data. To search and retrieve useful information from these linked data, users often enter queries into information retrieval (IR) systems. Among the information retrieved by these systems, some information is relevant to the user queries (i.e., interested to the users), but some is not. Moreover, some relevant information may not be retrieved by the systems. The effectiveness of these IR systems is often measured by metrics such as precision and recall. Most of the conventional IR systems (e.g., for web searches) aim to achieve high precision (i. e., high percentage of the retrieved information is relevant) at the price of low recall (i. e., low percentage of the relevant information is retrieved). However, there are real-life situations (e.g., patent searches) in which having high recall is desirable. In this paper, we present two high-recall IR systems. Results of our evaluation show the effectiveness of our systems in providing high-recall IR from linked big data.
引用
收藏
页码:712 / 717
页数:6
相关论文
共 50 条
  • [1] A System for Efficient High-Recall Retrieval
    Abualsaud, Mustafa
    Ghelani, Nimesh
    Zhang, Haotian
    Smucker, Mark D.
    Cormack, Gordon V.
    Grossman, Maura R.
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 1317 - 1320
  • [2] Interactive clustering and high-recall information retrieval using language models
    Rezaeipourfarsangi, Sima
    Pei, Ningyuan
    Sherkat, Ehsan
    Milios, Evangelos
    [J]. PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2022, 2022,
  • [3] Evaluating sentence-level relevance feedback for high-recall information retrieval
    Zhang, Haotian
    Cormack, Gordon V.
    Grossman, Maura R.
    Smucker, Mark D.
    [J]. INFORMATION RETRIEVAL JOURNAL, 2020, 23 (01): : 1 - 26
  • [4] Impact of Surrogate Assessments on High-Recall Retrieval
    Roegiest, Adam
    Cormack, Gordon V.
    Clarke, Charles L. A.
    Grossman, Maura R.
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 555 - 564
  • [5] Evaluating sentence-level relevance feedback for high-recall information retrieval
    Haotian Zhang
    Gordon V. Cormack
    Maura R. Grossman
    Mark D. Smucker
    [J]. Information Retrieval Journal, 2020, 23 : 1 - 26
  • [6] Active High-Recall Information Retrieval from Domain-Specific Text Corpora based on Query Documents
    Chen, Sitong
    Mohd, Abidalrahman
    Nourashrafeddin, Seyednaser
    Milios, Evangelos
    [J]. PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [7] Effective User Interaction for High-Recall Retrieval: Less is More
    Zhang, Haotian
    Abualsaud, Mustafa
    Ghelani, Nimesh
    Smucker, Mark D.
    Cormack, Gordon V.
    Grossman, Maura R.
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 187 - 196
  • [8] Learning Query-Space Document Representations for High-Recall Retrieval
    Salamat, Sara
    Arabzadeh, Negar
    Zarrinkalam, Fattane
    Zihayat, Morteza
    Bagheri, Ebrahim
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 599 - 607
  • [9] An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments
    Roegiest, Adam
    Cormack, Gordon V.
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1085 - 1088
  • [10] Relevance maximization for high-recall retrieval problem: finding all needles in a haystack
    Song, Justin JongSu
    Lee, Wookey
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (10): : 7734 - 7757