Research on Intelligent Retrieval Model of Multilingual Text Information in Corpus

被引:0
|
作者
Wu, Ri-han [1 ]
Cao, Yi-jie [2 ]
机构
[1] Northwest Minzu Univ, Sch Chinese Language & Literature, Lanzhou 730030, Peoples R China
[2] Northwest Minzu Univ, Sch Ethnol & Sociol, Lanzhou 730030, Peoples R China
关键词
Corpus; Language; Information retrieval;
D O I
10.1007/978-3-030-94551-0_3
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cross language information retrieval focuses on how to use the query expressed in one language to search the information expressed in another language. One of the key problems is to adopt different methods to establish bilingual semantic correspondence. In recent years, topic model has become an effective method in machine learning, information retrieval and natural language processing. This paper systematically studies the cross language retrieval model, cross language text classification method and cross language text clustering method. Without the help of cross language resources such as machine translation and bilingual dictionaries, it can effectively solve the many to many problem of Vocabulary Translation in CLIR and the problem of partial decomposition of unknown words. The experimental results on the cross language text classification evaluation corpus established in this paper show that the performance of cross language and single language text classification on the bilingual topic space constructed by this method is close to or better than that of single language classification on the original feature space, and the performance of cross language text clustering is close to or better than that of single language document clustering.
引用
收藏
页码:26 / 40
页数:15
相关论文
共 50 条
  • [1] Experiences in evaluating multilingual and text-image information retrieval
    Garcia-Serrano, Ana M.
    Martinez-Fernandez, Jose L.
    Martinez, Paloma
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (07) : 655 - 677
  • [2] Using corpus-based approaches in a system for multilingual information retrieval
    Braschler, M
    Schäuble, P
    [J]. INFORMATION RETRIEVAL, 2000, 3 (03): : 273 - 284
  • [3] Using Corpus-Based Approaches in a System for Multilingual Information Retrieval
    Martin Braschler
    Peter Schäuble
    [J]. Information Retrieval, 2000, 3 : 273 - 284
  • [4] Learning a merge model for multilingual information retrieval
    Tsai, Ming-Feng
    Chen, Hsin-Hsi
    Wang, Yu-Ting
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 635 - 646
  • [5] A multilingual approach to multilingual information retrieval
    Nie, JY
    Jin, F
    [J]. ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 101 - 110
  • [6] An intelligent information retrieval system model
    Chen, JJ
    Liu, LZ
    Song, HT
    Yu, XL
    [J]. PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2500 - 2503
  • [7] Advances in information retrieval: Recent research from the center for intelligent information retrieval
    Harabagiu, S
    [J]. COMPUTATIONAL LINGUISTICS, 2001, 27 (02) : 301 - 303
  • [8] The Personalized Information Retrieval Research Based on Intelligent
    Tian, Qiu-Yan
    Fei, Long
    Tian, Wei
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 699 - 704
  • [9] Intelligent information retrieval: some research trends
    Pasi, G
    [J]. ADVANCES IN SOFT COMPUTING: ENGINEERING DESIGN AND MANUFACTURING, 2003, : 159 - 171
  • [10] Combination approaches for multilingual text retrieval
    Braschler, M
    [J]. INFORMATION RETRIEVAL, 2004, 7 (1-2): : 183 - 204