Information Retrieval in Wikipedia with Conceptual Directions

被引:0
|
作者
Szymanski, Julian [1 ]
机构
[1] Gdansk Univ Technol, Dept Comp Syst Architecture, Gdansk, Poland
关键词
information retrieval; Wikipedia; documents clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper describes our algorithm used for retrieval of textual information from Wikipedia. The experiments show that the algorithm allows to improve typical evaluation measures of retrieval quality. The improvement of the retrieval results was achieved by two phase usage approach. In first the algorithm extends the set of content that has been indexed by the specified keywords and thus increases the Recall value. Then, using the interaction with the user by presenting him so-called Conceptual Directions the search results are purified, which allows to increase Precision value. The preliminary evaluation on multi-sense test phrases indicates, that the algorithm is able to increase the Precision, within result set, without Recall loss. We also describe an additional method used for extending the result set based on creating cluster prototypes and finding the most similar, not retrieved content in text repository. In our demo implementation in the form of web portal, clustering has been used to present the search results organized in thematic groups instead of ranked list.
引用
收藏
页码:391 / 402
页数:12
相关论文
共 50 条
  • [1] Exploiting Wikipedia for Information Retrieval Tasks
    Shapira, Bracha
    Ofek, Nir
    Makarenkov, Victor
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 1137 - 1140
  • [2] Conceptual information retrieval
    dos Santos, EL
    Hasegawa, FM
    Avila, BC
    Enembreck, F
    [J]. ADVANCED DISTRUBUTED SYSTEMS, 2004, 3061 : 137 - 144
  • [3] WikiMirs: A Mathematical Information Retrieval System for Wikipedia
    Hu, Xuan
    Gao, Liangcai
    Lin, Xiaoyan
    Tang, Zhi
    Lin, Xiaofan
    Baker, Josef B.
    [J]. JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 11 - 20
  • [4] A Wikipedia-based approach to conceptual indexing and retrieval of documents
    Chahine, Carlo Abi
    Chaignaud, Nathalie
    Kotowicz, Jean-Philippe
    Pecuchet, Jean-Pierre
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE AND LEARNING, 2014, 9 (1-2) : 87 - 103
  • [5] Conceptual clustering in information retrieval
    Bhatia, SK
    Deogun, JS
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03): : 427 - 436
  • [6] Conceptual guidance in information retrieval
    Seol, YH
    Johnson, SB
    Cimino, JJ
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, : 1026 - 1026
  • [7] Research Area Classification using Wikipedia and Information Retrieval
    Al-Ballaa, Hailah
    Al-Dossari, Hmood
    Mirza, Abdulrahman
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [8] Exploiting Wikipedia in integrating semantic annotation with information retrieval
    Fernandez-Garcia, Norberto
    Blazquez-del-Toro, Jose M.
    Sanchez-Fernandez, Luis
    Luque, Vicente
    [J]. ADVANCES IN WEB INTELLIGENCE AND DATA MINING, 2006, 23 : 61 - +
  • [9] Enhancing document modeling for information retrieval using wikipedia
    Luo, Jing
    Meng, Bo
    Tu, Xinhui
    [J]. International Journal of Advancements in Computing Technology, 2012, 4 (23) : 266 - 273
  • [10] Issues and directions in visual information retrieval
    Del Bimbo, A
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS: APPLICATIONS, ROBOTICS SYSTEMS AND ARCHITECTURES, 2000, : 31 - 38