Query expansion based on clustering and personalized information retrieval

被引:7
|
作者
Khalifi, Hamid [1 ]
Cherif, Walid [2 ]
El Qadi, Abderrahim [3 ]
Ghanou, Youssef [1 ]
机构
[1] Moulay Ismail Univ, High Sch Technol, TIM Team, Meknes, Morocco
[2] Natl Inst Stat & Appl Econ, Lab SI2M, Rabat, Morocco
[3] Mohammed V Univ, High Sch Technol, Rabat, Morocco
关键词
Information retrieval; Personalized information retrieval; Automatic query completion; Clustering; Performance evaluation; Support vector machines; MODELS;
D O I
10.1007/s13748-019-00178-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information retrieval systems are used to describe a variety of processes involving the delivery of information to people who need it. Although several mathematical approaches have been studied in order to formalize the main components of an information retrieval system: queries representation, information items representations and the retrieval process, such systems still face many difficulties to extract relevant information for users especially when the processed data are texts. This is due to the complex nature of text databases. Generally, an information retrieval system reformulates queries according to associations among information items before matching them to dataset items. In this sense, semantic relationships or machine learning techniques can be applied to refine the returned results. This paper presents a formal model to organize data, and a new search algorithm to browse it. It incorporates a natural language preprocessing stage, a statistical representation of short documents and queries and a machine learning model to select relevant results. We propose later in this paper two further optimizations that proved quite interesting and returned significantly satisfying results on two datasets in a reasonable computation time. The first optimization concerns queries expansions, while the second one concerns dataset restructuration. Thus, we formally evaluate the impact of each optimization by computing the performance of the information retrieval system with and without it; the highest reached recall and precision were 96.2% and 99.2%, respectively.
引用
收藏
页码:241 / 251
页数:11
相关论文
共 50 条
  • [21] Query expansion techniques for information retrieval: A survey
    Azad, Hiteshwar Kumar
    Deepak, Akshay
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (05) : 1698 - 1735
  • [22] Query expansion for intelligent information retrieval on Internet
    Lim, JH
    Seung, HW
    Hwang, J
    Kim, YC
    Kim, HN
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 656 - 662
  • [23] A new approach to query expansion in information retrieval
    李卫疆
    [J]. High Technology Letters, 2008, 14 (01) : 77 - 80
  • [24] An information retrieval system based on automatic query expansion and Hopfield network
    Sheng, XW
    Jiang, MH
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 1624 - 1627
  • [25] An information retrieval system based on automatic query expansion and hopfield network
    Wang, Lin
    Jiang, Minghu
    Sheng, Xiaowei
    Lu, Yinghua
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1519 - 1524
  • [26] Design and implementation of ontology-based query expansion for information retrieval
    Fang Wu
    Guoshi Wu
    Xangling Fu
    [J]. RESEARCH AND PRACTICAL ISSUES OF ENTERPRISE INFORMATION SYSTEMS II, VOL 1, 2008, 254 : 293 - +
  • [27] An improved VSM based information retrieval system and fuzzy query expansion
    Wu, JN
    Tanioka, H
    Wang, SZ
    Pan, DH
    Yamamoto, K
    Wang, ZT
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 1, PROCEEDINGS, 2005, 3613 : 537 - 546
  • [28] RESEARCH ON THE WEB INFORMATION RETRIEVAL MODEL BASED ON METADATA AND QUERY EXPANSION
    Hu, Changxia
    Liu, Xiaoxing
    Jin, Weiying
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 384 - +
  • [29] Query Expansion based on Word Embeddings and Ontologies for Efficient Information Retrieval
    Rastogi, Namrata
    Verma, Parul
    Kumar, Pankaj
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 367 - 373
  • [30] Design and implementation of ontology-based query expansion for information retrieval
    School of Software Engineering, Beijing University of Posts and Telecommunications, Beijing
    100879, China
    不详
    061001, China
    [J]. IFIP Advances in Information and Communication Technology, 2007, (293-298)