Xu: An Automated Query Expansion and Optimization Tool

被引:6
|
作者
Gallant, Morgan [1 ]
Isah, Haruna [1 ]
Zulkernine, Farhana [1 ]
Khan, Shahzad [2 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
[2] Gnowit Inc, Ottawa, ON, Canada
关键词
boolean query; datamuse api; high-dimensional clustering; information retrieval; search engine; user query; MEAN SHIFT;
D O I
10.1109/COMPSAC.2019.00070
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The exponential growth of information on the Internet is a big challenge for information retrieval systems towards generating relevant results. Novel approaches are required to reformat or expand user queries to generate a satisfactory response and increase recall and precision. Query expansion (QE) is a technique to broaden users' queries by introducing additional tokens or phrases based on some semantic similarity metrics. The tradeoff is the added computational complexity to find semantically similar words and a possible increase in noise in information retrieval. Despite several research efforts on this topic, QE has not yet been explored enough and more work is needed on similarity matching and composition of query terms with an objective to retrieve a small set of most appropriate responses. QE should be scalable, fast, and robust in handling complex queries with a good response time and noise ceiling. In this paper, we propose Xu, an automated QE technique, using high dimensional clustering of word vectors and Datamuse API, an open source query engine to find semantically similar words. We implemented Xu as a command line tool and evaluated its performances using datasets containing news articles and human-generated QEs. The evaluation results show that Xu was better than Datamuse by achieving about 88% accuracy with reference to the human-generated QE.
引用
收藏
页码:443 / 452
页数:10
相关论文
共 50 条
  • [1] Google Query Optimization Tool
    Aliyu, Farouq Muhammad
    Mabu, Audu Musa
    [J]. PROCEEDINGS OF THE 2014 IEEE 6TH INTERNATIONAL CONFERENCE ON ADAPTIVE SCIENCE AND TECHNOLOGY (ICAST 2014), 2014,
  • [2] Optimization of query expansion source in formal concept analysis
    Wang, Chang
    Du, Yajun
    Zhang, Peiying
    [J]. Journal of Convergence Information Technology, 2010, 5 (07)
  • [3] Optimization of some factors affecting the performance of query expansion
    Chung, YM
    Lee, JY
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (06) : 891 - 917
  • [4] Combining fields for query expansion and adaptive query expansion
    He, Ben
    Ounis, Iadh
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1294 - 1307
  • [5] Minerva: Automated Hardware Optimization Tool
    Farahmand, Farnoud
    Ferozpuri, Ahmed
    Diehl, William
    Gaj, Kris
    [J]. 2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2017,
  • [6] Guided automated learning for query workload re-optimization
    Damasio, Guilherme
    Corvinelli, Vincent
    Godfrey, Parke
    Mierzejewski, Piotr
    Mihaylov, Alexandar
    Szlichta, Jaroslaw
    Zuzarte, Calisto
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 2010 - 2021
  • [7] Query expansion
    Efthimiadis, EN
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1996, 31 : 121 - 187
  • [8] QMapper: A Tool for SQL Optimization on Hive Using Query Rewriting
    Xu, Yingzhong
    Hu, Songlin
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 211 - 212
  • [9] ODB-QOPTIMIZER: A tool for semantic query optimization in OODB
    Bergamaschi, S
    Beneventano, D
    Vincini, M
    Sartori, C
    [J]. 13TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING - PROCEEDINGS, 1997, : 578 - 578
  • [10] IPO: a tool for automated optimization of XCMS parameters
    Libiseller, Gunnar
    Dvorzak, Michaela
    Kleb, Ulrike
    Gander, Edgar
    Eisenberg, Tobias
    Madeo, Frank
    Neumann, Steffen
    Trausinger, Gert
    Sinner, Frank
    Pieber, Thomas
    Magnes, Christoph
    [J]. BMC BIOINFORMATICS, 2015, 16