Exploiting Wikipedia for Information Retrieval Tasks

被引:6
|
作者
Shapira, Bracha [1 ]
Ofek, Nir [1 ]
Makarenkov, Victor [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Informat Syst Engn, Beer Sheva, Israel
关键词
Wikipedia; Information Retrieval; Machine Learning;
D O I
10.1145/2766462.2767879
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Wikipedia - the online encyclopedia - has long been used as a source of information for researchers, as well as being a subject of research itself [11, 12, 23, 5, 6]. Wikipedia has been shown to be effective in recommender systems, sentiment analysis, validation and multiple domains in information retrieval. One of the reasons for Wikipedia's popularity among researchers and practitioners is the multiple types of information it contains, which enables practitioners to select the right "tool" for their respective tasks. In addition to its great potential, this multitude of information sources also poses a challenge: which sources of information are best suited for a specific problem and how can different types of data be combined? This tutorial aims to provide a holistic view of Wikipedia's different features - text, links, categories, page views, editing history etc. - and explore the different ways they can be utilized in a machine learning framework. By presenting and contrasting the latest works that utilize Wikipedia in multiple domains, this tutorial aims to increase the awareness among researchers and practitioners in these fields to the benefits of utilizing Wikipedia in their respective domains, in particular to the use of multiple sources of information simultaneously.
引用
收藏
页码:1137 / 1140
页数:4
相关论文
共 50 条
  • [21] WikiAutoCat: Information Retrieval System for Automatic Categorization of Wikipedia Articles
    Nesma Refaei
    Elsayed E. Hemayed
    Riham Mansour
    [J]. Arabian Journal for Science and Engineering, 2018, 43 : 8095 - 8109
  • [22] Exploiting Semantic Annotations in Math Information Retrieval
    Sojka, Petr
    [J]. PROCEEDINGS OF THE FIFTH WORKSHOP ON EXPLOITING SEMANTIC ANNOTATIONS IN INFORMATION RETRIEVAL, 2012, : 15 - 16
  • [23] Framework for Logging and Exploiting the Information Retrieval Dialog
    Landwich, Paul
    Klas, Claus-Peter
    Hemmje, Matthias
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2010, 6273 : 470 - 473
  • [24] EXPLOITING DISPARITY INFORMATION FOR STEREO IMAGE RETRIEVAL
    Chaker, A.
    Kaaniche, M.
    Benazza-Benyahia, A.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2993 - 2997
  • [25] Exploiting Ontology for Concept Based Information Retrieval
    Sharan, Aditi
    Joshi, Manju Lata
    Pandey, Anupama
    [J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 157 - 164
  • [26] Exploiting Semantic Coherence Features for Information Retrieval
    Tu, Xinhui
    Huang, Jimmy Xiangji
    Luo, Jing
    He, Tingting
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 837 - 840
  • [27] Exploiting salient semantic analysis for information retrieval
    Luo, Jing
    Meng, Bo
    Quan, Changqin
    Tu, Xinhui
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2016, 10 (09) : 959 - 969
  • [28] Exploiting Disambiguation and Discrimination in Information Retrieval Systems
    Basile, Pierpaolo
    Caputo, Annalina
    Semeraro, Giovanni
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 539 - 542
  • [29] Exploiting Temporal Information in Retrieval of Archived Documents
    Kanhabua, Nattiya
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 849 - 849
  • [30] Exploiting syntactic analysis of queries for information retrieval
    Mittendorfer, M
    Winiwarter, W
    [J]. DATA & KNOWLEDGE ENGINEERING, 2002, 42 (03) : 315 - 325