Evaluating Geographical Knowledge Re-Ranking, Linguistic Processing and Query Expansion Techniques for Geographical Information Retrieval

被引:3
|
作者
Ferres, Daniel [1 ]
Rodriguez, Horacio [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, ES-08034 Barcelona, Spain
关键词
Information retrieval; Geographical gazetteers; Natural language processing; Toponym disambiguation; Query expansion; Efectiveness measures; GEOCLEF;
D O I
10.1007/978-3-319-23826-5_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes and evaluates the use of Geographical Knowledge Re-Ranking, Linguistic Processing, and Query Expansion techniques to improve Geographical Information Retrieval effectiveness. Geographical Knowledge Re-Ranking is performed with Geographical Gazetteers and conservative Toponym Disambiguation techniques that boost the ranking of the geographically relevant documents retrieved by standard state-of-the-art Information Retrieval algorithms. Linguistic Processing is performed in two ways: 1) Part-of-Speech tagging and Named Entity Recognition and Classification are applied to analyze the text collections and topics to detect toponyms, 2) Stemming (Porter's algorithm) and Lemmatization are also applied in combination with default stopwords filtering. The Query Expansion methods tested are the Bose-Einstein (Bo1) and Kullback-Leibler term weighting models. The experiments have been performed with the English Monolingual test collections of the GeoCLEF evaluations (from years 2005, 2006, 2007, and 2008) using the TF-IDF, BM25, and InL2 Information Retrieval algorithms over unprocessed texts as baselines. The experiments have been performed with each GeoCLEF test collection (25 topics per evaluation) separately and with the fusion of all these collections (100 topics). The results of evaluating separately Geographical Knowledge Re-Ranking, Linguistic Processing (lemmatization, stemming, and the combination of both), and Query Expansion with the fusion of all the topics show that all these processes improve the Mean Average Precision (MAP) and RPrecision effectiveness measures in all the experiments and show statistical significance over the baselines in most of them. The best results in MAP and RPrecision are obtained with the InL2 algorithm using the following techniques: Geographical Knowledge Re-Ranking, Lemmatization with Stemming, and Kullback-Leibler Query Expansion. Some configurations with Geographical Knowledge Re-Ranking, Linguistic Processing and Query Expansion have improved the MAP of the best official results at GeoCLEF evaluations of 2005, 2006, and 2007.
引用
下载
收藏
页码:311 / 323
页数:13
相关论文
共 34 条
  • [21] Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval
    Rivas, A. R.
    Iglesias, E. L.
    Borrajo, L.
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [22] Applying Lemur Query Expansion Techniques in Biomedical Information Retrieval
    Rivas, A. R.
    Borrajo, L.
    Iglesias, E. L.
    Romero, R.
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2012, 151 : 403 - 410
  • [23] TEXT ATTRIBUTES AND PROCESSING TECHNIQUES IN GEOGRAPHICAL INFORMATION-SYSTEMS
    CARLOTTO, MJ
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SYSTEMS, 1995, 9 (06): : 621 - 635
  • [24] Retrieval, Re-ranking and Multi-task Learning for Knowledge-Base Question Answering
    Wang, Zhiguo
    Ng, Patrick
    Nallapati, Ramesh
    Xiang, Bing
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 347 - 357
  • [25] Efficient inverse query expansion in information retrieval using knowledge reduction
    Yoon, Changwoo
    WMSCI 2006: 10TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS, 2006, : 169 - 173
  • [26] Semantic query expansion combining association rules with ontologies and information retrieval techniques
    Song, M
    Song, IY
    Hu, YH
    Allen, R
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 326 - 335
  • [27] Phrasal translation and query expansion techniques for cross-language information retrieval
    Ballesteros, L
    Croft, WB
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 84 - 91
  • [28] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Abdulla, Ahmed AbdoAziz Ahmed
    Lin, Hongfei
    Xu, Bo
    Banbhrani, Santosh Kumar
    BMC BIOINFORMATICS, 2016, 17
  • [29] Improving biomedical information retrieval by linear combinations of different query expansion techniques
    Ahmed AbdoAziz Ahmed Abdulla
    Hongfei Lin
    Bo Xu
    Santosh Kumar Banbhrani
    BMC Bioinformatics, 17
  • [30] Improving the Results of Google Scholar Engine through Automatic Query Expansion Mechanism and Pseudo Re-ranking using MVRA
    Mosbah, Mawloud
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2018, 42 (02) : 219 - 229