Zero-shot learning based cross-lingual sentiment analysis for sanskrit text with insufficient labeled data

被引:0
|
作者
Puneet Kumar
Kshitij Pathania
Balasubramanian Raman
机构
[1] Indian Institute of Technology Roorkee,Department of Computer Science and Engineering
[2] Indian Institute of Technology Roorkee,Department of Mathematics
来源
Applied Intelligence | 2023年 / 53卷
关键词
Labeled data insufficiency; Cross-lingual sentiment analysis; Sanskrit language analysis; Machine translation;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a novel method for analyzing the sentiments portrayed by Sanskrit text has been proposed. Sanskrit is one of the world’s most ancient languages; however, natural language processing tasks such as machine translation and sentiment analysis have not been explored for it to the full potential because of the unavailability of sufficient labeled data. We solved this issue using a zero-shot learning-based cross-lingual sentiment analysis (CLSA) approach. The CLSA uses the resources from the source language to enhance the sentiment analysis of the target language having insufficient resources. The proposed work translates the text from Sanskrit, a language with insufficient labeled data, to English, with sufficient labeled data for sentiment analysis using a transformer model. A generative adversarial network-based strategy has been proposed to evaluate the maturity of the translations. Then a bidirectional long short-term memory-based model has been implemented to classify the sentiments using the embeddings obtained through translations. The proposed technique has achieved 87.50% accuracy for machine translation and 92.83% accuracy for sentiment classification. Sanskrit-English translations used in this work have been collected through web scraping techniques. In the absence of the ground-truth sentiment class labels, a strategy for evaluating the sentiment scores of the proposed sentiment analysis model has also been presented. A new dataset of Sanskrit text, along with their English translations and sentiment scores, has been constructed.
引用
收藏
页码:10096 / 10113
页数:17
相关论文
共 50 条
  • [31] Cross-Lingual Transfer in Zero-Shot Cross-Language Entity Linking
    Schumacher, Elliot
    Mayfield, James
    Dredze, Mark
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 583 - 595
  • [32] Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning
    Han, Janghoon
    Lee, Changho
    Shin, Joongbo
    Choi, Stanley Jungkyu
    Lee, Honglak
    Bae, Kyunghoon
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15436 - 15452
  • [33] Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-mix and Multi-view Learning
    Zhu, Zhihong
    Cheng, Xuxin
    Chen, Dongsheng
    Huang, Zhiqi
    Li, Hongxiang
    Zou, Yuexian
    INTERSPEECH 2023, 2023, : 3969 - 3973
  • [34] Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables
    Liu, Zihan
    Shin, Jamin
    Xu, Yan
    Winata, Genta Indra
    Xu, Peng
    Madotto, Andrea
    Fung, Pascale
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1297 - 1303
  • [35] Zero-shot Cross-lingual Transfer is Under-specified Optimization
    Wu, Shijie
    Van Durme, Benjamin
    Dredze, Mark
    PROCEEDINGS OF THE 7TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2022, : 236 - 248
  • [36] Zero-shot Cross-Lingual Phonetic Recognition with External Language Embedding
    Gao, Heting
    Ni, Junrui
    Zhang, Yang
    Qian, Kaizhi
    Chang, Shiyu
    Hasegawa-Johnson, Mark
    INTERSPEECH 2021, 2021, : 1304 - 1308
  • [37] Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
    Nozza, Debora
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 907 - 914
  • [38] Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification
    Xenouleas, Stratos
    Tsoukara, Alexia
    Panagiotakis, Giannis
    Chalkidis, Ilias
    Androutsopoulos, Ion
    PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [39] Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model
    Hsu, Tsung-Yuan
    Liu, Chi-liang
    Lee, Hung-yi
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5933 - 5940
  • [40] A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)