Exploring the Cross-Lingual Similarity of Valmiki Ramayana Using Semantic and Sentiment Analysis

被引:0
|
作者
Kulkarni, Pooja [1 ]
Birajdar, Gajanan K. [2 ]
机构
[1] DY Patil Deemed be Univ, Ramrao Adik Inst Technol Nerul, Dept Elect & Telecommun Engn, Navi Mumbai 400706, Maharashtra, India
[2] DY Patil Deemed be Univ, Ramrao Adik Inst Technol, Dept Elect Engn, Navi Mumbai 400706, Maharashtra, India
关键词
Natural language processing; BERT; cross lingual; semantic analysis; sentiment analysis;
D O I
10.1142/S2196888825500010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Sanskrit language holds significant importance in Indian culture because it has been extensively used in religious literature, primarily in Hinduism. Numerous ancient Hindu texts originally composed in Sanskrit have since been translated into various Indian and non-Indian languages by Indian and foreign authors. These translations offer a renewed cultural perspective and broaden the reach of Indian literature to a global audience. However, the manual translations of these religious texts often lack thorough validation. Recent advancements in semantic and sentiment analysis, powered by deep learning, have provided enhanced tools for understanding language and text. In this paper, we present a framework that uses semantic and sentiment analysis to validate the English translation of the Ramayana against its original Sanskrit version. The "Ramayana" which narrates the journey of the Rama, the king of Ayodhya, is an ancient Hindu epic written by the sage Valmiki. It is known for its contribution to human values for centuries and has universal relevance. Given the importance of Sanskrit in Indian culture and its influence on literature, understanding the translations of key texts like the Ramayana is essential. Multilingual Bidirectional Encoder Representations from Transformers (mBERT) model is utilized to analyze the selected chapters of the English and the Sanskrit versions of Ramayana. Our analysis reveals that sentiment and semantic alignment between the original Sanskrit and English translations remain consistent despite stylistic and vocabulary differences. The study also compares the findings of Bidirectional Encoder Representations from Transformers (BERT) with its other variants to examine which BERT variant is more suitable for validating Sanskrit text. The paper demonstrates the potential of deep learning techniques for cross-lingual validation of ancient texts.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] A Systematic Review of Cross-Lingual Sentiment Analysis: Tasks, Strategies, and Prospects
    Zhao, Chuanjun
    Wu, Meiling
    Yang, Xinyi
    Zhang, Wenyue
    Zhang, Shaoxia
    Wang, Suge
    Li, Deyu
    ACM COMPUTING SURVEYS, 2024, 56 (07)
  • [42] Improved Cross-Lingual Document Similarity Measurement
    Isuranga, Udhan
    Sandaruwan, Janaka
    Athukorala, Udesh
    Dias, Gihan
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 45 - 49
  • [43] Cross-lingual Annotation Projection of Semantic Roles
    Pado, Sebastian
    Lapata, Mirella
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 36 : 307 - 340
  • [44] Data Quality Controlling for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Xue, Yunxia
    Wang, Zhongqing
    Lee, Sophia Yat Mei
    Huang, Chu-Ren
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 125 - 128
  • [45] A Cross-Lingual Approach for Building Multilingual Sentiment Lexicons
    Naderalvojoud, Behzad
    Qasemizadeh, Behrang
    Kallmeyer, Laura
    Sezer, Ebru Akcapinar
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 259 - 266
  • [46] Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles
    Paramita, Monica Lestari
    Clough, Paul
    Gaizauskas, Robert
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 633 - 639
  • [47] Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
    Michelbacher, Lukas
    Laws, Florian
    Dorow, Beate
    Heid, Ulrich
    Schuetze, Hinrich
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [48] CROSS-LINGUAL TRANSFER FOR SPEECH PROCESSING USING ACOUSTIC LANGUAGE SIMILARITY
    Wu, Peter
    Shi, Jiatong
    Zhong, Yifan
    Watanabe, Shinji
    Black, Alan W.
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1050 - 1057
  • [49] Using semantic data to improve cross-lingual linking of article clusters
    Belyaeva, Evgenia
    Kosmerlj, Aljaz
    Muhic, Andrej
    Rupnik, Jan
    Fuart, Flavio
    JOURNAL OF WEB SEMANTICS, 2015, 35 : 64 - 70
  • [50] Multilingual Semantic Sourcing using Product Images for Cross-lingual Alignment
    Mangrulkar, Sourab
    Ankith, M. S.
    Sembium, Vivek
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 41 - 51