Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval

被引:0
|
作者
Mueller, Christof [1 ]
Gurevych, Iryna [1 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Ubiquitous Knowledge Proc Lab, D-64289 Darmstadt, Germany
关键词
Information Retrieval; Semantic Relatedness; Collaborative Knowledge Bases; Cross-Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main objective of our experiments in the domain-specific track at CLEF 2008 is utilizing semantic knowledge from collaborative knowledge bases such as Wikipedia and Wiktionary to improve the effectiveness of information retrieval. While Wikipedia has already been used in IR, the application of Wiktionary in this task is new. We evaluate two retrieval models, i.e. SR-Text and SR-Word, based on semantic relatedness by comparing their performance to a statistical model as implemented by Lucene. We refer to Wikipedia article titles and Wiktionary word entries as concepts and map query and document terms to concept vectors which are then used to compute the document relevance. lit the bilingual task, we translate the English topics into the document language, i.e. German, by rising machine translation. For SR-Text, we alternatively perform the translation process by using cross-language links in Wikipedia, whereby the terms are directly mapped to concept vectors in the target language. The evaluation shows that the latter approach especially improves the retrieval performance in cases where the machine translation system incorrectly translates query terms.
引用
收藏
页码:219 / 226
页数:8
相关论文
共 50 条
  • [41] Domain-specific cross-language relevant question retrieval
    Bowen Xu
    Zhenchang Xing
    Xin Xia
    David Lo
    Shanping Li
    Empirical Software Engineering, 2018, 23 : 1084 - 1122
  • [42] Domain-specific cross-language relevant question retrieval
    Xu, Bowen
    Xing, Zhenchang
    Xia, Xin
    Lo, David
    Li, Shanping
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (02) : 1084 - 1122
  • [43] Domain-Specific Cross-Language Relevant Question Retrieval
    Xu, Bowen
    Xing, Zhenchang
    Xia, Xin
    Lo, David
    Wang, Qingye
    Li, Shanping
    13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 413 - 424
  • [44] Defining and Using Domain-Specific Languages
    Lyytinen, Kalle
    Welke, Richard
    IEEE SOFTWARE, 2010, 27 (01) : 8 - 8
  • [45] Transferrable Framework Based on Knowledge Graphs for Generating Explainable Results in Domain-Specific, Intelligent Information Retrieval
    Abu-Rasheed, Hasan
    Weber, Christian
    Zenkert, Johannes
    Dornhofer, Mareike
    Fathi, Madjid
    INFORMATICS-BASEL, 2022, 9 (01):
  • [46] DFT-Extractor: A System to Extract Domain-specific Faceted Taxonomies from Wikipedia
    Wei, Bifan
    Liu, Jun
    Ma, Jian
    Zheng, Qinghua
    Zhang, Wei
    Feng, Boqin
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 277 - 280
  • [47] A food safety prescreening method with domain-specific information using online reviews
    Zuo, Enguang
    Aysa, Alimjan
    Muhammat, Mahpirat
    Zhao, Yuxia
    Chen, Bing
    Ubul, Kurban
    JOURNAL OF CONSUMER PROTECTION AND FOOD SAFETY, 2022, 17 (02) : 163 - 175
  • [48] Domain-specific Chinese word segmentation using suffix tree and mutual information
    Daniel Zeng
    Donghua Wei
    Michael Chau
    Feiyue Wang
    Information Systems Frontiers, 2011, 13 : 115 - 125
  • [49] A food safety prescreening method with domain-specific information using online reviews
    Enguang Zuo
    Alimjan Aysa
    Mahpirat Muhammat
    Yuxia Zhao
    Bing Chen
    Kurban Ubul
    Journal of Consumer Protection and Food Safety, 2022, 17 : 163 - 175
  • [50] Domain-specific Chinese word segmentation using suffix tree and mutual information
    Zeng, Daniel
    Wei, Donghua
    Chau, Michael
    Wang, Feiyue
    INFORMATION SYSTEMS FRONTIERS, 2011, 13 (01) : 115 - 125