Cross-lingual transfer learning: A PARAFAC2 approach

被引:1
|
作者
Pantraki, Evangelia [1 ]
Tsingalis, Ioannis [1 ]
Kotropoulos, Constantine [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki, Greece
关键词
PARAFAC2; Cross-lingual transfer learning; Cross-lingual document classification; Cross-lingual authorship attribution; Language processing; EMBEDDINGS;
D O I
10.1016/j.patrec.2022.05.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proposed framework addresses the problem of cross-lingual transfer learning resorting to Parallel Factor Analysis 2 (PARAFAC2). To avoid the need for multilingual parallel corpora, a pairwise setting is adopted where a PARAFAC2 model is fitted to documents written in English (source language) and a different target language. Firstly, an unsupervised PARAFAC2 model is fitted to parallel unlabelled corpora pairs to learn the latent relationship between the source and target language. The fitted model is used to create embeddings for a text classification task (document classification or authorship attribution). Subsequently, a logistic regression classifier is fitted to the training source language embeddings and tested on the training target language embeddings. Following the zero-shot setting, no labels are exploited for the target language documents. The proposed framework incorporates a self-learning process by utilizing the predicted labels as pseudo-labels to train a new, pseudo-supervised PARAFAC2 model, which aims to extract latent class-specific information while fusing language-specific information. Thorough evaluation is conducted on cross-lingual document classification and cross-lingual authorship attribution. Remarkably, the proposed framework achieves competitive results when compared to deep learning methods in cross-lingual transfer learning tasks. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:167 / 173
页数:7
相关论文
共 50 条
  • [11] Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation
    Wang, Chao
    Gaspers, Judith
    Do, Quynh
    Jiang, Hui
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2011 - 2020
  • [12] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
    Gjoreski, Kristijan
    Gjoreski, Aleksandar
    Kraljevski, Ivan
    Hirschfeld, Diane
    INTERSPEECH 2019, 2019, : 1916 - 1920
  • [13] Getting to the core of PARAFAC2, a nonnegative approach
    Van Benthem, Mark H.
    Keller, Timothy J.
    Gillispie, Gregory D.
    DeJong, Stephanie A.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 206
  • [14] Cross-Lingual Lexico-Semantic Transfer in Language Learning
    Kochmar, Ekaterina
    Shutova, Ekaterina
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 974 - 983
  • [15] Cross-lingual Transfer Learning for Japanese Named Entity Recognition
    Johnson, Andrew
    Karanasou, Penny
    Gaspers, Judith
    Klakow, Dietrich
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES(NAACL HLT 2019), VOL. 2 (INDUSTRY PAPERS), 2019, : 182 - 189
  • [16] Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
    Schuster, Sebastian
    Gupta, Sonal
    Shah, Rushin
    Lewis, Mike
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3795 - 3805
  • [17] Probabilistic PARAFAC2
    Jorgensen, Philip J. H.
    Nielsen, Soren F.
    Hinrich, Jesper L.
    Schmidt, Mikkel N.
    Madsen, Kristoffer H.
    Morup, Morten
    ENTROPY, 2024, 26 (08)
  • [18] Cross-lingual Transfer Learning for Semantic Role Labeling in Russian
    Alimova, Ilseyar
    Tutubalina, Elena
    Kirillovich, Alexander
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA (CLIB '20), 2020, : 72 - 80
  • [19] Zero-Shot Cross-Lingual Transfer with Meta Learning
    Nooralahzadeh, Farhad
    Bekoulis, Giannis
    Bjerva, Johannes
    Augenstein, Isabelle
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4547 - 4562
  • [20] Cross-Lingual Transfer Learning for Medical Named Entity Recognition
    Ding, Pengjie
    Wang, Lei
    Liang, Yaobo
    Lu, Wei
    Li, Linfeng
    Wang, Chun
    Tang, Buzhou
    Yan, Jun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 403 - 418