Research on Intelligent Retrieval Model of Multilingual Text Information in Corpus

被引:0
|
作者
Wu, Ri-han [1 ]
Cao, Yi-jie [2 ]
机构
[1] Northwest Minzu Univ, Sch Chinese Language & Literature, Lanzhou 730030, Peoples R China
[2] Northwest Minzu Univ, Sch Ethnol & Sociol, Lanzhou 730030, Peoples R China
关键词
Corpus; Language; Information retrieval;
D O I
10.1007/978-3-030-94551-0_3
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cross language information retrieval focuses on how to use the query expressed in one language to search the information expressed in another language. One of the key problems is to adopt different methods to establish bilingual semantic correspondence. In recent years, topic model has become an effective method in machine learning, information retrieval and natural language processing. This paper systematically studies the cross language retrieval model, cross language text classification method and cross language text clustering method. Without the help of cross language resources such as machine translation and bilingual dictionaries, it can effectively solve the many to many problem of Vocabulary Translation in CLIR and the problem of partial decomposition of unknown words. The experimental results on the cross language text classification evaluation corpus established in this paper show that the performance of cross language and single language text classification on the bilingual topic space constructed by this method is close to or better than that of single language classification on the original feature space, and the performance of cross language text clustering is close to or better than that of single language document clustering.
引用
收藏
页码:26 / 40
页数:15
相关论文
共 50 条
  • [41] The Research and Application in Intelligent Document Retrieval Based on Text Quantification and Subject Mapping
    Wang, Qin
    Qu, Shouning
    Du, Tao
    Zhang, Mingjing
    [J]. ADVANCED DESIGNS AND RESEARCHES FOR MANUFACTURING, PTS 1-3, 2013, 605-607 : 2561 - +
  • [42] A multilingual text mining approach to web cross-lingual text retrieval
    Chau, RW
    Yeh, CH
    [J]. KNOWLEDGE-BASED SYSTEMS, 2004, 17 (5-6) : 219 - 227
  • [43] Multilingual information retrieval in the language modeling framework
    Rahimi, Razieh
    Shakery, Azadeh
    King, Irwin
    [J]. INFORMATION RETRIEVAL JOURNAL, 2015, 18 (03): : 246 - 281
  • [44] Multilingual information retrieval in the language modeling framework
    Razieh Rahimi
    Azadeh Shakery
    Irwin King
    [J]. Information Retrieval Journal, 2015, 18 : 246 - 281
  • [45] How Robust are Multilingual Information Retrieval Systems?
    Mandl, Thomas
    Womser-Hacker, Christa
    Di Nunzio, Giorgio
    Ferro, Nicola
    [J]. APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1132 - 1136
  • [46] Hybrid query expansion model for text and microblog information retrieval
    Meriem Amina Zingla
    Chiraz Latiri
    Philippe Mulhem
    Catherine Berrut
    Yahya Slimani
    [J]. Information Retrieval Journal, 2018, 21 : 337 - 367
  • [47] Hybrid query expansion model for text and microblog information retrieval
    Zingla, Meriem Amina
    Latiri, Chiraz
    Mulhem, Philippe
    Berrut, Catherine
    Slimani, Yahya
    [J]. INFORMATION RETRIEVAL JOURNAL, 2018, 21 (04): : 337 - 367
  • [48] Selection and merging strategies for multilingual information retrieval
    Savoy, J
    Berger, PY
    [J]. MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 27 - 37
  • [49] Multilingual Geographical Information Retrieval systems in CLEF
    Perea Ortega, Jose Manuel
    Garcia Cumbreras, Miguel Angel
    Garcia Vega, Manuel
    Urena Lopez, L. Alfonso
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (40): : 129 - 136
  • [50] Intelligent Information Retrieval Model Based on Multi-Agents
    Xiao, Yi
    Xiao, Ming
    Zhang, Fan
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 5464 - +