Complementing WordNet with Roget's and corpus-based thesauri for information retrieval

被引：0

作者：

Mandala, R ^{[1
]}

Tokunaga, T ^{[1
]}

Tanaka, H ^{[1
]}

机构：

[1] Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, Tokyo 1528522, Japan

来源：

NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS | 1999年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a method to overcome the drawbacks of WordNet when applied to information retrieval by complementing it with Roget's thesaurus and corpus-derived thesauri. Words and relations which are not included in WordNet can be found in the corpus-derived thesauri. Effects of polysemy can be minimized with weighting method considering all query terms and all of the thesauri. Experimental results show that our method enhances information retrieval performance significantly.

引用

页码：94 / 101

页数：8

共 50 条

[21] An object-based information retrieval model: Toward the structural construction of thesauri
Han, JJ
Choi, JH
Park, JJ
Yang, JD
Lee, JK
IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS, 1998, : 117 - 125
[22] ORGANIZATION OF THE INVERTED FILES IN A DISTRIBUTED INFORMATION-RETRIEVAL SYSTEM BASED ON THESAURI
MAZUR, Z
INFORMATION PROCESSING & MANAGEMENT, 1986, 22 (03) : 243 - 250
[23] Survey and prospect of China's corpus-based research
Yang, XJ
CORPUS LINGUISTICS AROUND THE WORLD, 2006, (56): : 219 - 233
[24] A corpus-based relevance feedback approach to cross-language image retrieval
Chang, Yih-Chen
Lin, Wen-Cheng
Chen, Hsin-Hsi
ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 592 - 601
[25] Beyonce's style in Lemonade: a corpus-based analysis
Sidoruk, Natasha Barth
Rebechi, Rozane Rodrigues
ANTARES-LETRAS E HUMANIDADES, 2020, 12 (25): : 4 - 27
[26] A latent semantic indexing and WordNet based information retrieval model for digital forensics
Du, Lan
Jin, Huidong
de Vel, Olivier
Liu, Nianjun
ISI 2008: 2008 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS, 2008, : 70 - +
[27] Methods and trends of biomedical and genomic information retrieval based on semantic relations of thesauri and MeSH
Moran Reyes, Ariel Antonio
Naumis Pena, Catalina
INVESTIGACION BIBLIOTECOLOGICA, 2016, 30 (68): : 109 - 123
[28] Italian medical language: A corpus-based study on patient information leaflets
Nitti, Paolo
FORUM ITALICUM, 2025,
[29] Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System
Pronoza, Ekaterina
Yagunova, Elena
Volskaya, Svetlana
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 272 - 284
[30] A Corpus-Based Study of the Chinese Translations of Shakespeare's Plays
Li, Limin
MULTICULTURAL SHAKESPEARE-TRANSLATION APPROPRIATION AND PERFORMANCE, 2020, 22 (37) : 191 - 196

← 1 2 3 4 5 →