Xu: An Automated Query Expansion and Optimization Tool

被引:6
|
作者
Gallant, Morgan [1 ]
Isah, Haruna [1 ]
Zulkernine, Farhana [1 ]
Khan, Shahzad [2 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
[2] Gnowit Inc, Ottawa, ON, Canada
关键词
boolean query; datamuse api; high-dimensional clustering; information retrieval; search engine; user query; MEAN SHIFT;
D O I
10.1109/COMPSAC.2019.00070
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The exponential growth of information on the Internet is a big challenge for information retrieval systems towards generating relevant results. Novel approaches are required to reformat or expand user queries to generate a satisfactory response and increase recall and precision. Query expansion (QE) is a technique to broaden users' queries by introducing additional tokens or phrases based on some semantic similarity metrics. The tradeoff is the added computational complexity to find semantically similar words and a possible increase in noise in information retrieval. Despite several research efforts on this topic, QE has not yet been explored enough and more work is needed on similarity matching and composition of query terms with an objective to retrieve a small set of most appropriate responses. QE should be scalable, fast, and robust in handling complex queries with a good response time and noise ceiling. In this paper, we propose Xu, an automated QE technique, using high dimensional clustering of word vectors and Datamuse API, an open source query engine to find semantically similar words. We implemented Xu as a command line tool and evaluated its performances using datasets containing news articles and human-generated QEs. The evaluation results show that Xu was better than Datamuse by achieving about 88% accuracy with reference to the human-generated QE.
引用
收藏
页码:443 / 452
页数:10
相关论文
共 50 条
  • [41] A new approach to query expansion
    Li, JF
    Guo, MZ
    Tian, SH
    [J]. PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 2302 - 2306
  • [42] Semantic approaches for query expansion
    Sharma, Dilip Kumar
    Pamula, Rajendra
    Chauhan, D. S.
    [J]. EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 1101 - 1116
  • [43] Query Expansion in Enterprise Search
    Domke, Eric M.
    Leidig, Jonathan P.
    Schymik, Gregory
    Wolffe, Greg
    [J]. PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [44] KNN dynamic expansion query based on artificial intelligence and popular music curriculum optimization
    Yan, Zhang
    [J]. SOFT COMPUTING, 2023,
  • [45] A New Modeling of Query Expansion Using an Effective Bat-Inspired Optimization Algorithm
    Khennak, Ilyes
    Drias, Habiba
    Kechid, Samir
    [J]. IFAC PAPERSONLINE, 2016, 49 (12): : 1791 - 1796
  • [46] Web query expansion by WordNet
    Gong, ZG
    Cheang, CW
    Hou, UL
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 166 - 175
  • [47] A Framework for Automatic Query Expansion
    Imran, Hazra
    Sharan, Aditi
    [J]. WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 386 - +
  • [48] Query Expansion Associated with Clustering
    Yin, Jie
    Xu, Weiran
    [J]. MACHINERY ELECTRONICS AND CONTROL ENGINEERING III, 2014, 441 : 647 - 650
  • [49] Query optimization-oriented lateral expansion method of distributed geological borehole database
    Luo, Qingjia
    [J]. OPEN GEOSCIENCES, 2023, 15 (01)
  • [50] Techniques for efficient query expansion
    Billerbeck, B
    Zobel, J
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 3246 : 30 - 42