Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking

被引:0
|
作者
Zhi-Jing Wu
Yi-Qun Liu
Jia-Xin Mao
Min Zhang
Shao-Ping Ma
机构
[1] Tsinghua University,Department of Computer Science and Technology
[2] Tsinghua University,Beijing National Research Center for Information Science and Technology
[3] Renmin University of China,Gaoling School of Artificial Intelligence
关键词
document ranking; neural network; passage cumulative gain;
D O I
暂无
中图分类号
学科分类号
摘要
Document ranking is one of the most studied but challenging problems in information retrieval (IR). More and more studies have begun to address this problem from fine-grained document modeling. However, most of them focus on context-independent passage-level relevance signals and ignore the context information. In this paper, we investigate how information gain accumulates with passages and propose the context-aware Passage Cumulative Gain (PCG). The fine-grained PCG avoids the need to split documents into independent passages. We investigate PCG patterns at the document level (DPCG) and the query level (QPCG). Based on the patterns, we propose a BERT-based sequential model called Passage-level Cumulative Gain Model (PCGM) and show that PCGM can effectively predict PCG sequences. Finally, we apply PCGM to the document ranking task using two approaches. The first one is leveraging DPCG sequences to estimate the gain of an individual document. Experimental results on two public ad hoc retrieval datasets show that PCGM outperforms most existing ranking models. The second one considers the cross-document effects and leverages QPCG sequences to estimate the marginal relevance. Experimental results show that predicted results are highly consistent with users’ preferences. We believe that this work contributes to improving ranking performance and providing more explainability for document ranking.
引用
收藏
页码:814 / 838
页数:24
相关论文
共 50 条
  • [1] Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking
    Wu, Zhi-Jing
    Liu, Yi-Qun
    Mao, Jia-Xin
    Zhang, Min
    Ma, Shao-Ping
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (04) : 814 - 838
  • [2] Leveraging Passage-level Cumulative Gain for Document Ranking
    Wu, Zhijing
    Mao, Jiaxin
    Liu, Yiqun
    Zhan, Jingtao
    Zheng, Yukun
    Zhang, Min
    Ma, Shaoping
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2421 - 2431
  • [3] Recurrent event query decoder for document-level event extraction
    Kong, Jing
    Yang, Zhouwang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 139
  • [4] Leveraging Document-Level Label Consistency for Named Entity Recognition
    Gui, Tao
    Ye, Jiacheng
    Zhang, Qi
    Zhou, Yaqian
    Gong, Yeyun
    Huang, Xuanjing
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3976 - 3982
  • [5] Investigating Passage-level Relevance and Its Role in Document-level Relevance Judgment
    Wu, Zhijing
    Mao, Jiaxin
    Liu, Yiqun
    Zhang, Min
    Ma, Shaoping
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 605 - 614
  • [6] Query-Level Stability of Ranking SVM for Replacement Case
    Gao, Yun
    Gao, Wei
    Zhang, Yungang
    CEIS 2011, 2011, 15
  • [7] Survey on Document-Level Relation Extraction
    Zhou Y.
    Huang H.
    Liu H.
    Hao Z.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (04): : 10 - 25
  • [8] Document-Level Relation Extraction with Reconstruction
    Xu, Wang
    Chen, Kehai
    Zhao, Tiejun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14167 - 14175
  • [9] Document-Level Planning for Text Simplification
    Cripwell, Liam
    Legrand, Joel
    Gardent, Claire
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 993 - 1006
  • [10] Generalization Bounds of Ranking via Query-Level Stability I
    He, Xiangguang
    Gao, Wei
    Jia, Zhiyang
    INFORMATION AND MANAGEMENT ENGINEERING, PT VI, 2011, 236 : 188 - +