Ontology-based Tamil–English cross-lingual information retrieval system

被引:0
|
作者
D Thenmozhi
Chandrabose Aravindan
机构
[1] SSN College of Engineering,Department of Computer Science and Engineering
来源
Sādhanā | 2018年 / 43卷
关键词
Cross-lingual information retrieval system; ontology; Tamil–English query translation; query expansion; semantic web;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-lingual information retrieval (CLIR) systems facilitate users to query for information in one language and retrieve relevant documents in another language. In general, CLIR systems translate query in source language to target language and retrieve documents in target language based on the keywords present in the translated query. However, the presence of ambiguity in source and translated queries reduces the performance of the system. Ontology can be used to address this problem. The current approaches to ontology-based CLIR systems use manually constructed multilingual ontology, which is expensive. However, many methods exist to automatically construct ontology for any domain in English but not in other languages like Tamil. We propose a methodology for Tamil–English CLIR system by translating the Tamil query to English and retrieve pages in English to address these issues. Our approach uses a word sense disambiguation module to resolve the ambiguity in Tamil query. An automatically constructed ontology in English is used to address the ambiguity of English query. We have developed a morphological analyser for Tamil language, Tamil–English bilingual dictionary and named entity database to translate a Tamil query to English. The translated query is reformulated using ontology and the reformulated queries are given to a search engine to retrieve English documents from the Internet. We have evaluated our methodology for agriculture domain and the evaluation results show that our approach outperforms other approaches in terms of precision.
引用
收藏
相关论文
共 50 条
  • [1] Ontology-based Tamil-English cross-lingual information retrieval system
    Thenmozhi, D.
    Aravindan, Chandrabose
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2018, 43 (10):
  • [2] Chinese-English cross-lingual information retrieval based on domain ontology knowledge
    Yu, Feng
    Zheng, Dequan
    Zhao, Tiejun
    Li, Sheng
    Yu, Hao
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1460 - 1463
  • [3] A system for supporting cross-lingual information retrieval
    Capstick, J
    Diagne, AK
    Erbach, G
    Uszkoreit, H
    Leisenberg, A
    Leisenberg, M
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2000, 36 (02) : 275 - 289
  • [4] English-Malayalam Cross-Lingual Information Retrieval - an experience
    Nikesh, P. L.
    Sumam, Mary Idicula
    David, Peter S.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2008, : 271 - 275
  • [5] Cross-Lingual Information Retrieval System for Indian Languages
    Jagarlamudi, Jagadeesh
    Kumaran, A.
    [J]. ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 80 - 87
  • [6] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Ghanbari, Elham
    Shakery, Azadeh
    [J]. APPLIED INTELLIGENCE, 2022, 52 (03) : 3156 - 3174
  • [7] An ontology-based information retrieval system
    Varga, P
    Mészáros, T
    Dezsényi, C
    Dobrowiecki, TP
    [J]. DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 359 - 368
  • [8] A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval
    Elham Ghanbari
    Azadeh Shakery
    [J]. Applied Intelligence, 2022, 52 : 3156 - 3174
  • [9] Semantic Cross-Lingual Information Retrieval
    Pourmahmoud, Solmaz
    Shamsfard, Mehrnoush
    [J]. 23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 80 - +
  • [10] Cross-lingual information retrieval by feature vectors
    Lilleng, Jeanine
    Tomassen, Stein L.
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 229 - +