Exploring Neural Translation Models for Cross-Lingual Text Similarity

被引:2
|
作者
Seki, Kazuhiro [1 ]
机构
[1] Konan Univ, Kobe, Hyogo, Japan
关键词
Sequence-to-sequence models; distributed representation; cross-lingual information retrieval;
D O I
10.1145/3269206.3269262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores a neural network-based approach to computing similarity of two texts written in different languages. Such similarity can be useful for a variety of applications including cross-lingual information retrieval and cross-lingual text classification. To compute similarity, we focus on neural machine translation models and examine the utility of their intermediate states. Through experiments on an English-Japanese translation corpus, it is demonstrated that the intermediate states of input texts are indeed beneficial for computing cross-lingual text similarity, outperforming other approaches including a strong machine translation-based baseline.
引用
收藏
页码:1591 / 1594
页数:4
相关论文
共 50 条
  • [31] Cross-lingual Transfer of Monolingual Models
    Gogoulou, Evangelia
    Ekgren, Ariel
    Isbister, Tim
    Sahlgren, Magnus
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 948 - 955
  • [32] Cross-lingual text filtering based on text concepts and kNN
    Li, SZ
    Su, WF
    Li, TQ
    Chen, HW
    [J]. PACLIC 17: Language, Information and Computation, Proceedings, 2003, : 166 - 173
  • [33] Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech
    Zhan, Haoyue
    Yu, Xinyuan
    Zhang, Haitong
    Zhang, Yang
    Lin, Yue
    [J]. INTERSPEECH 2022, 2022, : 4247 - 4251
  • [34] Document Similarity for Arabic and Cross-Lingual Web Content
    Salhi, Ali
    Yahya, Adnan H.
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 134 - 146
  • [35] Linear transformations for cross-lingual semantic textual similarity
    Brychcin, Tomas
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [36] Fractional Similarity: Cross-Lingual Feature Selection for Search
    Jagarlamudi, Jagadeesh
    Bennett, Paul N.
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 226 - +
  • [37] Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models
    Moskovskiy, Daniil
    Dementieva, Daryna
    Panchenko, Alexander
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 346 - 354
  • [38] A Sense Based Similarity Measure for Cross-Lingual Documents
    Huang, Hsun-Hui
    Yang, Horng-Chang
    Kuo, Yau-Hwang
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 9 - +
  • [39] Cross-Lingual Semantic Similarity Measure for Comparable Articles
    Saad, Motaz
    Langlois, David
    Smaili, Kamel
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2014, 8686 : 105 - +
  • [40] Data Augmentation with Unsupervised Machine Translation Improves the Structural Similarity of Cross-lingual Word Embeddings
    Nishikawa, Sosuke
    Ri, Ryokan
    Tsuruoka, Yoshimasa
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 163 - 173