Ontology-based Tamil–English cross-lingual information retrieval system

被引:0
|
作者
D Thenmozhi
Chandrabose Aravindan
机构
[1] SSN College of Engineering,Department of Computer Science and Engineering
来源
Sādhanā | 2018年 / 43卷
关键词
Cross-lingual information retrieval system; ontology; Tamil–English query translation; query expansion; semantic web;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-lingual information retrieval (CLIR) systems facilitate users to query for information in one language and retrieve relevant documents in another language. In general, CLIR systems translate query in source language to target language and retrieve documents in target language based on the keywords present in the translated query. However, the presence of ambiguity in source and translated queries reduces the performance of the system. Ontology can be used to address this problem. The current approaches to ontology-based CLIR systems use manually constructed multilingual ontology, which is expensive. However, many methods exist to automatically construct ontology for any domain in English but not in other languages like Tamil. We propose a methodology for Tamil–English CLIR system by translating the Tamil query to English and retrieve pages in English to address these issues. Our approach uses a word sense disambiguation module to resolve the ambiguity in Tamil query. An automatically constructed ontology in English is used to address the ambiguity of English query. We have developed a morphological analyser for Tamil language, Tamil–English bilingual dictionary and named entity database to translate a Tamil query to English. The translated query is reformulated using ontology and the reformulated queries are given to a search engine to retrieve English documents from the Internet. We have evaluated our methodology for agriculture domain and the evaluation results show that our approach outperforms other approaches in terms of precision.
引用
下载
收藏
相关论文
共 50 条
  • [11] Ontology-based intelligent information retrieval system
    Yang, Yue-Hua
    Du, Jun-Ping
    Ping, Yuan
    Ruan Jian Xue Bao/Journal of Software, 2015, 26 (07): : 1675 - 1687
  • [12] Visual Ontology-based Information Retrieval System
    Zhuhadar, Leyla
    Nasraoui, Olfa
    Wyatt, Robert
    INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 419 - 426
  • [13] Ontology-based intelligent information retrieval system
    Pan, Ying
    Wang, Tianjiang
    Jiang, Xueling
    Journal of Computational Information Systems, 2008, 4 (01): : 91 - 96
  • [14] Dictionary methods for cross-lingual information retrieval
    Ballesteros, L
    Croft, B
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 1996, 1134 : 791 - 801
  • [15] Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
    Yeo, Eun Jung
    Choi, Kwanghee
    Kim, Sunhee
    Chung, Minhwa
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 566 - 574
  • [16] The application of the comparable corpora in Chinese-English Cross-Lingual Information Retrieval
    Du, L
    Zhang, YB
    Sun, L
    Sun, YF
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (04) : 351 - 358
  • [17] The Application of the Comparable Corpora in Chinese-English Cross-Lingual Information Retrieval
    杜林
    张毅波
    孙乐
    孙玉芳
    Journal of Computer Science & Technology, 2001, (04) : 351 - 358
  • [18] The application of the comparable corpora in Chinese-English Cross-Lingual Information Retrieval
    Lin Du
    Yibo Zhang
    Le Sun
    Yufang Sun
    Journal of Computer Science and Technology, 2001, 16 : 351 - 358
  • [19] Cross-lingual information retrieval model based on bilingual topic correlation
    Luo, Yuansheng
    Le, Zhongjian
    Wang, Mingwen
    Journal of Computational Information Systems, 2013, 9 (06): : 2433 - 2440
  • [20] SemOIR: An ontology-based semantic information retrieval system
    Tang, Mingwei
    Chen, Jiangping
    Chen, Haihua
    COMPANION OF THE 2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS-C 2020), 2020, : 204 - 208