Evaluating Geographical Knowledge Re-Ranking, Linguistic Processing and Query Expansion Techniques for Geographical Information Retrieval

被引:3
|
作者
Ferres, Daniel [1 ]
Rodriguez, Horacio [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, ES-08034 Barcelona, Spain
关键词
Information retrieval; Geographical gazetteers; Natural language processing; Toponym disambiguation; Query expansion; Efectiveness measures; GEOCLEF;
D O I
10.1007/978-3-319-23826-5_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes and evaluates the use of Geographical Knowledge Re-Ranking, Linguistic Processing, and Query Expansion techniques to improve Geographical Information Retrieval effectiveness. Geographical Knowledge Re-Ranking is performed with Geographical Gazetteers and conservative Toponym Disambiguation techniques that boost the ranking of the geographically relevant documents retrieved by standard state-of-the-art Information Retrieval algorithms. Linguistic Processing is performed in two ways: 1) Part-of-Speech tagging and Named Entity Recognition and Classification are applied to analyze the text collections and topics to detect toponyms, 2) Stemming (Porter's algorithm) and Lemmatization are also applied in combination with default stopwords filtering. The Query Expansion methods tested are the Bose-Einstein (Bo1) and Kullback-Leibler term weighting models. The experiments have been performed with the English Monolingual test collections of the GeoCLEF evaluations (from years 2005, 2006, 2007, and 2008) using the TF-IDF, BM25, and InL2 Information Retrieval algorithms over unprocessed texts as baselines. The experiments have been performed with each GeoCLEF test collection (25 topics per evaluation) separately and with the fusion of all these collections (100 topics). The results of evaluating separately Geographical Knowledge Re-Ranking, Linguistic Processing (lemmatization, stemming, and the combination of both), and Query Expansion with the fusion of all the topics show that all these processes improve the Mean Average Precision (MAP) and RPrecision effectiveness measures in all the experiments and show statistical significance over the baselines in most of them. The best results in MAP and RPrecision are obtained with the InL2 algorithm using the following techniques: Geographical Knowledge Re-Ranking, Lemmatization with Stemming, and Kullback-Leibler Query Expansion. Some configurations with Geographical Knowledge Re-Ranking, Linguistic Processing and Query Expansion have improved the MAP of the best official results at GeoCLEF evaluations of 2005, 2006, and 2007.
引用
下载
收藏
页码:311 / 323
页数:13
相关论文
共 34 条
  • [1] Analysis of NLP techniques of query expansion applied to the Geographical Information Retrieval task
    Perea-Ortega, Jose M.
    Garcia-Cumbreras, Miguel A.
    Alfonso Urena-Lopez, L.
    Montejo-Raez, Arturo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 41 - 48
  • [2] Re-ranking Image Retrieval in Challenging Geographical Iconographic Heritage Collections
    Blettery, Emile
    Gouet-Brunet, Valerie
    20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 97 - 103
  • [3] Local Stretch Similarity Measure and Auto Query Expansion for Re-Ranking of Image Retrieval
    Feng, Lin
    Luo, Sen
    Liu, Guochao
    Liu, Shenglan
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE), 2017, 190 : 181 - 187
  • [4] Adaptive Query Re-ranking Based on ImageGraph for Image Retrieval
    Fan, Haonan
    Hu, Hai-Miao
    Wang, Rong
    Zhang, Yugui
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4593 - 4599
  • [5] An automatic learning for re-ranking in social information retrieval
    Haddad, Rihab
    Hlaoua, Lobna
    2020 15TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2020), 2020, : 50 - 55
  • [6] A multilevel searching and re-ranking framework for information retrieval
    Wen, Miao
    Huang, Xiangji
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 619 - +
  • [7] Query specific re-ranking for improved cross-modal retrieval
    Mandal, Devraj
    Biswas, Soma
    PATTERN RECOGNITION LETTERS, 2017, 98 : 110 - 116
  • [8] BERT-QE: Contextualized Query Expansion for Document Re-ranking
    Zheng, Zhi
    Hui, Kai
    He, Ben
    Han, Xianpei
    Sun, Le
    Yates, Andrew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4718 - 4728
  • [9] Re-ranking vehicle re-identification with orientation-guide query expansion
    Zhang, Xue
    Nie, Xiushan
    Sun, Ziruo
    Li, Xiaofeng
    Wang, Chuntao
    Tao, Peng
    Hussain, Sumaira
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2022, 18 (03)
  • [10] Visual Re-Ranking for Multi-Aspect Information Retrieval
    Klouche, Khalil
    Ruotsalo, Tuukka
    Micallef, Luana
    Andolina, Salvatore
    Jacucci, Giulio
    CHIIR'17: PROCEEDINGS OF THE 2017 CONFERENCE HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2017, : 57 - 66