Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval

被引:0
|
作者
Mueller, Christof [1 ]
Gurevych, Iryna [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Ubiquitous Knowledge Proc Lab, D-64289 Darmstadt, Germany
关键词
Information Retrieval; Semantic Relatedness; Collaborative Knowledge Bases; Cross-Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main objective of our experiments in the domain-specific track at CLEF 2008 is utilizing semantic knowledge from collaborative knowledge bases such as Wikipedia and Wiktionary to improve the effectiveness of information retrieval. While Wikipedia has already been used in IR, the application of Wiktionary in this task is new. We evaluate two retrieval models, i.e. SR-Text and SR-Word, based on semantic relatedness by comparing their performance to a statistical model as implemented by Lucene. We refer to Wikipedia article titles and Wiktionary word entries as concepts and map query and document terms to concept vectors which are then used to compute the document relevance. lit the bilingual task, we translate the English topics into the document language, i.e. German, by rising machine translation. For SR-Text, we alternatively perform the translation process by using cross-language links in Wikipedia, whereby the terms are directly mapped to concept vectors in the target language. The evaluation shows that the latter approach especially improves the retrieval performance in cases where the machine translation system incorrectly translates query terms.
引用
收藏
页码:219 / 226
页数:8
相关论文
共 50 条
  • [1] Domain-Specific Information Retrieval Using Recommenders
    Li, Wei
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1327 - 1327
  • [2] Enhanced Information Retrieval Using Domain-Specific Recommender Models
    Li, Wei
    Ganguly, Debasis
    Jones, Gareth J. F.
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2011, 6931 : 201 - 212
  • [3] WikiDoMiner: Wikipedia Domain-Specific Miner
    Ezzini, Saad
    Abualhaija, Sallam
    Sabetzadeh, Mehrdad
    PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1706 - 1710
  • [4] Patent Information Retrieval An Instance of Domain-specific Search
    Lupu, Mihai
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1189 - 1190
  • [5] Medical Information Retrieval An Instance of Domain-Specific Search
    Hanbury, Allan
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1191 - 1192
  • [6] Domain-specific knowledge base enrichment using Wikipedia tables
    Ran, Chenwei
    Shen, Wei
    Wang, Jianyong
    Zhu, Xuan
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 349 - 358
  • [7] Toward a Semantic Granularity Model for Domain-Specific Information Retrieval
    Yan, Xin
    Lau, Raymond Y. K.
    Song, Dawei
    Li, Xue
    Ma, Jian
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (03)
  • [8] Domain-specific retrieval of source information in the medial temporal lobe
    Peters, Jan
    Suchan, Boris
    Koester, Odo
    Daum, Irene
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2007, 26 (05) : 1333 - 1343
  • [9] Domain-specific information retrieval based on improved language model
    Kang, Kai
    Lin, Kunhui
    Zhou, Changle
    Guo, Feng
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 374 - +
  • [10] Domain-Specific Automatic Scholar Profiling Based on Wikipedia
    Chuai, Ziang
    Geng, Qian
    Jin, Jian
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 786 - 793