Complementing WordNet with Roget's and corpus-based thesauri for information retrieval

被引:0
|
作者
Mandala, R [1 ]
Tokunaga, T [1 ]
Tanaka, H [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, Tokyo 1528522, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a method to overcome the drawbacks of WordNet when applied to information retrieval by complementing it with Roget's thesaurus and corpus-derived thesauri. Words and relations which are not included in WordNet can be found in the corpus-derived thesauri. Effects of polysemy can be minimized with weighting method considering all query terms and all of the thesauri. Experimental results show that our method enhances information retrieval performance significantly.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 50 条
  • [21] An object-based information retrieval model: Toward the structural construction of thesauri
    Han, JJ
    Choi, JH
    Park, JJ
    Yang, JD
    Lee, JK
    IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS, 1998, : 117 - 125
  • [23] Survey and prospect of China's corpus-based research
    Yang, XJ
    CORPUS LINGUISTICS AROUND THE WORLD, 2006, (56): : 219 - 233
  • [24] A corpus-based relevance feedback approach to cross-language image retrieval
    Chang, Yih-Chen
    Lin, Wen-Cheng
    Chen, Hsin-Hsi
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 592 - 601
  • [25] Beyonce's style in Lemonade: a corpus-based analysis
    Sidoruk, Natasha Barth
    Rebechi, Rozane Rodrigues
    ANTARES-LETRAS E HUMANIDADES, 2020, 12 (25): : 4 - 27
  • [26] A latent semantic indexing and WordNet based information retrieval model for digital forensics
    Du, Lan
    Jin, Huidong
    de Vel, Olivier
    Liu, Nianjun
    ISI 2008: 2008 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS, 2008, : 70 - +
  • [27] Methods and trends of biomedical and genomic information retrieval based on semantic relations of thesauri and MeSH
    Moran Reyes, Ariel Antonio
    Naumis Pena, Catalina
    INVESTIGACION BIBLIOTECOLOGICA, 2016, 30 (68): : 109 - 123
  • [28] Italian medical language: A corpus-based study on patient information leaflets
    Nitti, Paolo
    FORUM ITALICUM, 2025,
  • [29] Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System
    Pronoza, Ekaterina
    Yagunova, Elena
    Volskaya, Svetlana
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 272 - 284
  • [30] A Corpus-Based Study of the Chinese Translations of Shakespeare's Plays
    Li, Limin
    MULTICULTURAL SHAKESPEARE-TRANSLATION APPROPRIATION AND PERFORMANCE, 2020, 22 (37) : 191 - 196