Incorporating Transformer Models for Sentiment Analysis and News Classification in Khmer

被引:0
|
作者
Rifat, Md Rifatul Islam [1 ]
Al Imran, Abdullah [2 ]
机构
[1] Rajshahi Univ Engn & Technol, Rajshahi, Bangladesh
[2] Amer Int Univ Bangladesh, Dhaka, Bangladesh
关键词
Khmer; Deep learning; Sentiment analysis; News classification; Transformer models;
D O I
10.1007/978-3-030-91434-9_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, natural language modeling has achieved a major breakthrough with its sophisticated theoretical and technical advancements. Leveraging the power of deep learning, transformer models have created a disrupting impact in the domain of natural language processing. However, the benefits of such advancements are still inscribed between few highly resourced languages such as English, German, and French. Low-resourced language such as Khmer is still deprived of utilizing these advancements due to lack of technical support for this language. In this study, our objective is to apply the state-of-the-art language models within two empirical use cases such as Sentiment Analysis and News Classification in the Khmer language. To perform the classification tasks, we have employed FastText and BERT for extracting word embeddings and carried out three different type of experiments such as FastText, BERT feature-based, and BERT fine-tuning-based. A large text corpus including over 100,000 news articles has been used for pre-training the transformer model, BERT. The outcome of our experiment shows that in both of the use cases, a pre-trained and fine-tuned BERT model produces the outperforming results.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 50 条
  • [21] Sentiment Classification of the Slovenian News Texts
    Bucar, Joze
    Povh, Janez
    Znidarsic, Martin
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 777 - 787
  • [22] Sentiment Analysis in the News
    Balahur, Alexandra
    Steinberger, Ralf
    Kabadjov, Mijail
    Zavarella, Vanni
    van der Goot, Erik
    Halkia, Matina
    Pouliquen, Bruno
    Belyaeva, Jenya
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2216 - 2220
  • [23] Exploring transformer models for sentiment classification: A comparison of BERT, RoBERTa, ALBERT, DistilBERT, and XLNet
    Areshey, Ali
    Mathkour, Hassan
    [J]. EXPERT SYSTEMS, 2024,
  • [24] Hidden Variable Models in Text Classification and Sentiment Analysis
    Koochemeshkian, Pantea
    Koffi, Eddy Ihou
    Bouguila, Nizar
    [J]. ELECTRONICS, 2024, 13 (10)
  • [25] Comparative Analysis of Transformer Models for Sentiment Analysis in Low-Resource Languages
    Aliyu, Yusuf
    Sarlan, Aliza
    Danyaro, Kamaluddeen Usman
    Rahman, Abdulahi Sani B. A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 353 - 364
  • [26] Incorporating product description to sentiment topic models for improved aspect-based sentiment analysis
    Amplayo, Reinald Kim
    Lee, Seanie
    Song, Min
    [J]. INFORMATION SCIENCES, 2018, 454 : 200 - 215
  • [27] Sentiment Analysis, Visualization and Classification of Summarized News Articles: A Novel Approach
    Urologin, Siddhaling
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 616 - 625
  • [28] An Ensemble of Arabic Transformer-based Models for Arabic Sentiment Analysis
    El Karfi, Ikram
    El Fkihi, Sanaa
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 561 - 567
  • [29] Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
    Lakomkin, Egor
    Zamani, Mohammad Ali
    Webers, Cornelius
    Magg, Sven
    Wermter, Stefan
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7976 - 7982
  • [30] NEWS VIDEO STORY SENTIMENT CLASSIFICATION AND RANKING
    Liu, Chunxi
    Su, Li
    Huang, Qingming
    Jiang, Shuqiang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,