Cross-Lingual Adaptation Using Structural Correspondence Learning

被引:28
|
作者
Prettenhofer, Peter [1 ]
Stein, Benno [1 ]
机构
[1] Bauhaus Univ Weimar, D-99421 Weimar, Germany
关键词
Algorithms; Experimentation; Performance; Cross-language text classification; cross-lingual adaptation; structural correspondence learning; SELECTION; RETRIEVAL;
D O I
10.1145/2036264.2036277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual adaptation is a special case of domain adaptation and refers to the transfer of classification knowledge between two languages. In this article we describe an extension of Structural Correspondence Learning (SCL), a recently proposed algorithm for domain adaptation, for cross-lingual adaptation in the context of text classification. The proposed method uses unlabeled documents from both languages, along with a word translation oracle, to induce a cross-lingual representation that enables the transfer of classification knowledge from the source to the target language. The main advantages of this method over existing methods are resource efficiency and task specificity. We conduct experiments in the area of cross-language topic and sentiment classification involving English as source language and German, French, and Japanese as target languages. The results show a significant improvement of the proposed method over a machine translation baseline, reducing the relative error due to cross-lingual adaptation by an average of 30% (topic classification) and 59% (sentiment classification). We further report on empirical analyses that reveal insights into the use of unlabeled data, the sensitivity with respect to important hyperparameters, and the nature of the induced cross-lingual word correspondences.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Cross-lingual adaptation using structural correspondence learning
    Prettenhofer, Peter
    Stein, Benno
    ACM Transactions on Intelligent Systems and Technology, 2011, 3 (01)
  • [2] Cross-lingual Adaptation Using Universal Dependencies
    Taghizadeh, Nasrin
    Faili, Heshaam
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [3] Structural Correspondence Learning for Cross-Lingual Sentiment Classification with One-to-Many Mappings
    Li, Nana
    Zhai, Shuangfei
    Zhang, Zhongfei
    Liu, Boying
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3490 - 3496
  • [4] Cross-Lingual Knowledge Transferring by Structural Correspondence and Space Transfer
    Wang, Deqing
    Wu, Junjie
    Yang, Jingyuan
    Jing, Baoyu
    Zhang, Wenjie
    He, Xiaonan
    Zhang, Hui
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6555 - 6566
  • [5] Cross-lingual Continual Learning
    M'hamdi, Meryem
    Ren, Xiang
    May, Jonathan
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3908 - 3943
  • [6] Meta-Learning for Fast Cross-Lingual Adaptation in Dependency Parsing
    Langedijk, Anna
    Dankers, Verna
    Lippe, Phillip
    Bos, Sander
    Guevara, Bryan Cardenas
    Yannakoudakis, Helen
    Shutova, Ekaterina
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8503 - 8520
  • [7] SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism
    Fatima, Mehwish
    Kolber, Tim
    Markert, Katja
    Strube, Michael
    NewSumm 2023 - Proceedings of the 4th New Frontiers in Summarization Workshop, Proceedings of EMNLP Workshop, 2023, : 24 - 40
  • [8] Cross-lingual Adaptation for Recipe Retrieval with Mixup
    Zhu, Bin
    Ngo, Chong-Wah
    Chen, Jingjing
    Chan, Wing-Kwong
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 258 - 267
  • [9] Cross-Lingual Learning with Distributed Representations
    Pikuliak, Matus
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8032 - 8033
  • [10] Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
    Maniati, Georgia
    Ellinas, Nikolaos
    Markopoulos, Konstantinos
    Vamvoukakis, Georgios
    Sung, June Sig
    Park, Hyoungmin
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    INTERSPEECH 2021, 2021, : 1594 - 1598