Exploiting Wikipedia for Information Retrieval Tasks

被引:6
|
作者
Shapira, Bracha [1 ]
Ofek, Nir [1 ]
Makarenkov, Victor [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Informat Syst Engn, Beer Sheva, Israel
关键词
Wikipedia; Information Retrieval; Machine Learning;
D O I
10.1145/2766462.2767879
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Wikipedia - the online encyclopedia - has long been used as a source of information for researchers, as well as being a subject of research itself [11, 12, 23, 5, 6]. Wikipedia has been shown to be effective in recommender systems, sentiment analysis, validation and multiple domains in information retrieval. One of the reasons for Wikipedia's popularity among researchers and practitioners is the multiple types of information it contains, which enables practitioners to select the right "tool" for their respective tasks. In addition to its great potential, this multitude of information sources also poses a challenge: which sources of information are best suited for a specific problem and how can different types of data be combined? This tutorial aims to provide a holistic view of Wikipedia's different features - text, links, categories, page views, editing history etc. - and explore the different ways they can be utilized in a machine learning framework. By presenting and contrasting the latest works that utilize Wikipedia in multiple domains, this tutorial aims to increase the awareness among researchers and practitioners in these fields to the benefits of utilizing Wikipedia in their respective domains, in particular to the use of multiple sources of information simultaneously.
引用
收藏
页码:1137 / 1140
页数:4
相关论文
共 50 条
  • [1] Exploiting Wikipedia in integrating semantic annotation with information retrieval
    Fernandez-Garcia, Norberto
    Blazquez-del-Toro, Jose M.
    Sanchez-Fernandez, Luis
    Luque, Vicente
    [J]. ADVANCES IN WEB INTELLIGENCE AND DATA MINING, 2006, 23 : 61 - +
  • [2] Exploiting Wikipedia for cross-lingual and multilingual information retrieval
    Sorg, P.
    Cimiano, P.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2012, 74 : 26 - 45
  • [3] Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks
    Hakkani-Tuer, Dilek
    Tur, Gokhan
    Levit, Michael
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2660 - +
  • [4] Exploiting Wikipedia API for Hindi-English Cross-Language Information Retrieval
    Sharma, Vijay Kumar
    Mittal, Namita
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 434 - 440
  • [5] Information Retrieval in Wikipedia with Conceptual Directions
    Szymanski, Julian
    [J]. DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2015, 2015, 8956 : 391 - 402
  • [6] WikiMirs: A Mathematical Information Retrieval System for Wikipedia
    Hu, Xuan
    Gao, Liangcai
    Lin, Xiaoyan
    Tang, Zhi
    Lin, Xiaofan
    Baker, Josef B.
    [J]. JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 11 - 20
  • [7] Information retrieval models in the context of retrieval tasks
    O. L. Golitsyna
    N. V. Maksimov
    [J]. Automatic Documentation and Mathematical Linguistics, 2011, 45 (1) : 20 - 32
  • [8] Information Retrieval Models in the Context of Retrieval Tasks
    Golitsyna, O. L.
    Maksimov, N. V.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2011, 45 (01) : 20 - 32
  • [9] Research Area Classification using Wikipedia and Information Retrieval
    Al-Ballaa, Hailah
    Al-Dossari, Hmood
    Mirza, Abdulrahman
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [10] Enhancing document modeling for information retrieval using wikipedia
    Luo, Jing
    Meng, Bo
    Tu, Xinhui
    [J]. International Journal of Advancements in Computing Technology, 2012, 4 (23) : 266 - 273