The effect of citation analysis on query expansion for patent retrieval

被引:13
|
作者
Mahdabi, Parvaz [1 ]
Crestani, Fabio [1 ]
机构
[1] Univ Lugano, Fac Informat, Lugano, Switzerland
来源
INFORMATION RETRIEVAL | 2014年 / 17卷 / 5-6期
关键词
Citation analysis; Patent retrieval; Query expansion;
D O I
10.1007/s10791-013-9232-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patent prior art search is a type of search in the patent domain where documents are searched for that describe the work previously carried out related to a patent application. The goal of this search is to check whether the idea in the patent application is novel. Vocabulary mismatch is one of the main problems of patent retrieval which results in low retrievability of similar documents for a given patent application. In this paper we show how the term distribution of the cited documents in an initially retrieved ranked list can be used to address the vocabulary mismatch. We propose a method for query modeling estimation which utilizes the citation links in a pseudo relevance feedback set. We first build a topic dependent citation graph, starting from the initially retrieved set of feedback documents and utilizing citation links of feedback documents to expand the set. We identify the important documents in the topic dependent citation graph using a citation analysis measure. We then use the term distribution of the documents in the citation graph to estimate a query model by identifying the distinguishing terms and their respective weights. We then use these terms to expand our original query. We use CLEF-IP 2011 collection to evaluate the effectiveness of our query modeling approach for prior art search. We also study the influence of different parameters on the performance of the proposed method. The experimental results demonstrate that the proposed approach significantly improves the recall over a state-of-the-art baseline which uses the link-based structure of the citation graph but not the term distribution of the cited documents.
引用
收藏
页码:412 / 429
页数:18
相关论文
共 50 条
  • [31] Query Expansion for Effective Geographic Information Retrieval
    Pu, Qiang
    He, Daqing
    Li, Qi
    [J]. EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 843 - +
  • [32] Query expansion techniques for information retrieval: A survey
    Azad, Hiteshwar Kumar
    Deepak, Akshay
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (05) : 1698 - 1735
  • [33] An information retrieval model based on query expansion
    Huang, Mingxuan
    Zhang, Shichao
    Yan, Xiaowei
    Huang, Faliang
    [J]. RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 217 - 221
  • [34] A new approach to query expansion in information retrieval
    李卫疆
    [J]. High Technology Letters, 2008, 14 (01) : 77 - 80
  • [35] Query Expansion for Effective Retrieval from Microblog
    Patel, Sindur
    Bhatt, Nirav
    Shah, Chandni
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 394 - 397
  • [36] Patent Citation Analysis With Google
    Kousha, Kayvan
    Thelwall, Mike
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (01) : 48 - 61
  • [37] Patent citation network analysis
    Lee, Minjung
    Kim, Yongdai
    Jang, Woncheol
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (04) : 613 - 625
  • [38] RETRACTED ARTICLE: Query expansion for patent retrieval using a modified stellar-mass black hole optimization
    G. David Raj
    Saswati Mukherjee
    G. V. Uma
    R. L. Jasmine
    R. Balamurugan
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 4841 - 4853
  • [39] Retraction Note to: Query expansion for patent retrieval using a modified stellar-mass black hole optimization
    G. David Raj
    Saswati Mukherjee
    G. V. Uma
    R. L. Jasmine
    R. Balamurugan
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (Suppl 1) : 347 - 347
  • [40] A query expansion framework in image retrieval domain based on local and global analysis
    Rahman, M. M.
    Antani, S. K.
    Thoma, G. R.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 676 - 691