Research on Cross-language Text Similarity Calculation

被引:0
|
作者
Yuan, Sun [1 ]
Qian, Zhao [1 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Natl Language Resource & Monitoring Res Ctr, Minor Languages Branch, Beijing, Peoples R China
关键词
text similarity; cross-language; tibetan-chinese; LDA model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-language text similarity calculation is a critical and fundamental problem in natural language processing. It is widely used in cross-language research, such as cross-language information retrieval. In this paper, we used the LDA (Latent Dirichlet Allocation) model to calculate similarities of Tibetan and Chinese texts at the topic level. Through topic modelling and forecasting, the texts are mapped to the feature space of topics. This method reduced the dimensions of text space vector and improved the speed and efficiency of computation.
引用
收藏
页码:423 / 426
页数:4
相关论文
共 50 条
  • [1] Calculation of Chinese-Thai Cross-Language Similarity Based on Sentence Embedding
    Feng Yinhan
    Zhan Gang
    Mao Weixiu
    Lin Shunbao
    Yu Shijie
    Zhang Kui
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 268 - 271
  • [2] An Automatic Measure of Cross-Language Text Structures
    Kim K.
    [J]. Technology, Knowledge and Learning, 2018, 23 (2) : 301 - 314
  • [3] WordNet based cross-language text categorization
    Amine, Bentaallah Mohamed
    Mimoun, Malki
    [J]. 2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 848 - +
  • [4] Applying EuroWordNet to Cross-Language Text Retrieval
    Julio Gonzalo
    Felisa Verdejo
    Carol Peters
    Nicoletta Calzolari
    [J]. Computers and the Humanities, 1998, 32 : 185 - 207
  • [5] Applying EuroWordNet to cross-language text retrieval
    Gonzalo, J
    Verdejo, F
    Peters, C
    Calzolari, N
    [J]. COMPUTERS AND THE HUMANITIES, 1998, 32 (2-3): : 185 - 207
  • [6] Adaptive support for cross-language text retrieval
    De Luca, Ernesto William
    Nuernberger, Andreas
    [J]. ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, PROCEEDINGS, 2006, 4018 : 425 - 429
  • [7] CROSS-LANGUAGE ADAPTATION: COLLOCATION IN MEDIA TEXT
    Yilmaz, Elvira Rafilovna
    [J]. LAPLAGE EM REVISTA, 2020, 6 : 1 - 6
  • [8] Cross-Language Similarity Modulates Effectiveness of Second Language Grammar Instruction
    Tolentino, Leida C.
    Tokowicz, Natasha
    [J]. LANGUAGE LEARNING, 2014, 64 (02) : 279 - 309
  • [9] Text-Independent Cross-Language Voice Conversion
    Suendermann, David
    Hoege, Harald
    Bonafonte, Antonio
    Ney, Hermann
    Hirschberg, Julia
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2262 - +
  • [10] Wikipedia-based cross-language text classification
    Mourino Garcia, Marcos Antonio
    Perez Rodriguez, Roberto
    Anido Rifon, Luis
    [J]. INFORMATION SCIENCES, 2017, 406 : 12 - 28