Zero-shot learning based cross-lingual sentiment analysis for sanskrit text with insufficient labeled data

被引:4
|
作者
Kumar, Puneet [1 ]
Pathania, Kshitij [2 ]
Raman, Balasubramanian [1 ]
机构
[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee, Uttar Pradesh, India
[2] Indian Inst Technol Roorkee, Dept Math, Roorkee, Uttar Pradesh, India
关键词
Labeled data insufficiency; Cross-lingual sentiment analysis; Sanskrit language analysis; Machine translation;
D O I
10.1007/s10489-022-04046-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel method for analyzing the sentiments portrayed by Sanskrit text has been proposed. Sanskrit is one of the world's most ancient languages; however, natural language processing tasks such as machine translation and sentiment analysis have not been explored for it to the full potential because of the unavailability of sufficient labeled data. We solved this issue using a zero-shot learning-based cross-lingual sentiment analysis (CLSA) approach. The CLSA uses the resources from the source language to enhance the sentiment analysis of the target language having insufficient resources. The proposed work translates the text from Sanskrit, a language with insufficient labeled data, to English, with sufficient labeled data for sentiment analysis using a transformer model. A generative adversarial network-based strategy has been proposed to evaluate the maturity of the translations. Then a bidirectional long short-term memory-based model has been implemented to classify the sentiments using the embeddings obtained through translations. The proposed technique has achieved 87.50% accuracy for machine translation and 92.83% accuracy for sentiment classification. Sanskrit-English translations used in this work have been collected through web scraping techniques. In the absence of the ground-truth sentiment class labels, a strategy for evaluating the sentiment scores of the proposed sentiment analysis model has also been presented. A new dataset of Sanskrit text, along with their English translations and sentiment scores, has been constructed.
引用
收藏
页码:10096 / 10113
页数:18
相关论文
共 50 条
  • [41] Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition
    Chatzoudis, Gerasimos
    Plitsis, Manos
    Stamouli, Spyridoula
    Dimou, Athanasia-Lida
    Katsamanis, Nassos
    Katsouros, Vassilis
    [J]. INTERSPEECH 2022, 2022, : 2178 - 2182
  • [42] Zero-shot cross-lingual transfer language selection using linguistic similarity
    Eronen, Juuso
    Ptaszynski, Michal
    Masui, Fumito
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [43] Transfer language selection for zero-shot cross-lingual abusive language detection
    Eronen, Juuso
    Ptaszynski, Michal
    Masui, Fumito
    Arata, Masaki
    Leliwa, Gniewosz
    Wroczynski, Michal
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
  • [44] Beyond the EnglishWeb: Zero-Shot Cross-Lingual and Lightweight Monolingual Classification of Registers
    Repo, Liina
    Skantsi, Valtteri
    Ronnqvist, Samuel
    Hellstrom, Saara
    Oinonen, Miika
    Salmela, Anna
    Biber, Douglas
    Egbert, Jesse
    Pyysalo, Sampo
    Laippala, Veronika
    [J]. EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 183 - 191
  • [45] Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization
    Ponti, Edoardo M.
    Vulic, Ivan
    Glavas, Goran
    Mrksic, Nikola
    Korhonen, Anna
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 282 - 293
  • [46] The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer
    Efimov, Pavel
    Boytsov, Leonid
    Arslanova, Elena
    Braslavski, Pavel
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 51 - 67
  • [47] Zero-Shot Cross-Lingual Knowledge Transfer in VQA via Multimodal Distillation
    Weng, Yu
    Dong, Jun
    He, Wenbin
    Chaomurilige
    Liu, Xuan
    Liu, Zheng
    Gao, Honghao
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [48] Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERT
    Chen, Beiduo
    Guo, Wu
    Liu, Quan
    Tao, Kun
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1428 - 1435
  • [49] Cross-Lingual Pre-Training Based Transfer for Zero-Shot Neural Machine Translation
    Ji, Baijun
    Zhang, Zhirui
    Duan, Xiangyu
    Zhang, Min
    Chen, Boxing
    Luo, Weihua
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 115 - 122
  • [50] CJE-TIG: Zero-shot cross-lingual text-to-image generation by Corpora-based Joint Encoding
    Zhang, Han
    Yang, Suyi
    Zhu, Hongqing
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 239