Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval

被引:0
|
作者
Mueller, Christof [1 ]
Gurevych, Iryna [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Ubiquitous Knowledge Proc Lab, D-64289 Darmstadt, Germany
关键词
Information Retrieval; Semantic Relatedness; Collaborative Knowledge Bases; Cross-Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main objective of our experiments in the domain-specific track at CLEF 2008 is utilizing semantic knowledge from collaborative knowledge bases such as Wikipedia and Wiktionary to improve the effectiveness of information retrieval. While Wikipedia has already been used in IR, the application of Wiktionary in this task is new. We evaluate two retrieval models, i.e. SR-Text and SR-Word, based on semantic relatedness by comparing their performance to a statistical model as implemented by Lucene. We refer to Wikipedia article titles and Wiktionary word entries as concepts and map query and document terms to concept vectors which are then used to compute the document relevance. lit the bilingual task, we translate the English topics into the document language, i.e. German, by rising machine translation. For SR-Text, we alternatively perform the translation process by using cross-language links in Wikipedia, whereby the terms are directly mapped to concept vectors in the target language. The evaluation shows that the latter approach especially improves the retrieval performance in cases where the machine translation system incorrectly translates query terms.
引用
收藏
页码:219 / 226
页数:8
相关论文
共 50 条
  • [21] Transactions in Domain-Specific Information Systems
    Zacek, Jaroslav
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2016 (ICNAAM-2016), 2017, 1863
  • [22] Domain-specific information extraction structures
    Lyons, S
    Smith, D
    13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 80 - 84
  • [23] Segmentation Fusion for Building Detection Using Domain-Specific Information
    Karadag, Ozge Oztimur
    Senaras, Caglar
    Vural, Fatos T. Yarman
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (07) : 3305 - 3315
  • [24] Graph-Based Domain-Specific Semantic Relatedness from Wikipedia
    Sajadi, Armin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 381 - 386
  • [25] A concept-based information retrieval approach for engineering domain-specific technical documents
    Lin, Hsien-Tang
    Chi, Nai-Wen
    Hsieh, Shang-Hsien
    ADVANCED ENGINEERING INFORMATICS, 2012, 26 (02) : 349 - 360
  • [26] Domain-specific data mining for residents' transit pattern retrieval from incomplete information
    Liu, Yongxin
    Li, Jianqiang
    Ming, Zhong
    Song, Houbing
    Weng, Xiaoxiong
    Wang, Jian
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 134 : 62 - 71
  • [27] A Sequential Latent Topic-Based Readability Model for Domain-Specific Information Retrieval
    Zhang, Wenya
    Song, Dawei
    Zhang, Peng
    Zhao, Xiaozhao
    Hou, Yuexian
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2015, 2015, 9460 : 241 - 252
  • [28] A Social Media Tool for Domain-Specific Information Retrieval - A Case Study in Human Trafficking
    Grine, Tito
    Lopes, Carla Teixeira
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 23 - 38
  • [29] Automatic Domain-specific Corpora Generation from Wikipedia - A Replication Study
    Ruwanpura, Seniru
    Morash, Cale
    Khan, Momin Ali
    Ahmad, Adnan
    Ginde, Gouri
    2023 IEEE 31ST INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW, 2023, : 85 - 94
  • [30] A novel approach for building Domain-specific Lexical Repository with Chinese Wikipedia
    Ruan, Zhijian
    Li, Xiu
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1097 - 1104