Incorporating Transformer Models for Sentiment Analysis and News Classification in Khmer

被引:0
|
作者
Rifat, Md Rifatul Islam [1 ]
Al Imran, Abdullah [2 ]
机构
[1] Rajshahi Univ Engn & Technol, Rajshahi, Bangladesh
[2] Amer Int Univ Bangladesh, Dhaka, Bangladesh
关键词
Khmer; Deep learning; Sentiment analysis; News classification; Transformer models;
D O I
10.1007/978-3-030-91434-9_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, natural language modeling has achieved a major breakthrough with its sophisticated theoretical and technical advancements. Leveraging the power of deep learning, transformer models have created a disrupting impact in the domain of natural language processing. However, the benefits of such advancements are still inscribed between few highly resourced languages such as English, German, and French. Low-resourced language such as Khmer is still deprived of utilizing these advancements due to lack of technical support for this language. In this study, our objective is to apply the state-of-the-art language models within two empirical use cases such as Sentiment Analysis and News Classification in the Khmer language. To perform the classification tasks, we have employed FastText and BERT for extracting word embeddings and carried out three different type of experiments such as FastText, BERT feature-based, and BERT fine-tuning-based. A large text corpus including over 100,000 news articles has been used for pre-training the transformer model, BERT. The outcome of our experiment shows that in both of the use cases, a pre-trained and fine-tuned BERT model produces the outperforming results.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 50 条
  • [1] Sentiment Polarity Classification for Khmer
    Khim, Sokheng
    Thu, Ye Kyaw
    Sam, Sethserey
    [J]. 2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP, 2023,
  • [2] Efficient Transformer Based Sentiment Classification Models
    Mathew, Leeja
    Bindu, V.R.
    [J]. Informatica (Slovenia), 2022, 46 (08): : 175 - 184
  • [3] Incorporating Pre-trained Transformer Models into TextCNN for Sentiment Analysis on Software Engineering Texts
    Sun, Kexin
    Shi, XiaoBo
    Gao, Hui
    Kuang, Hongyu
    Ma, Xiaoxing
    Rong, Guoping
    Shao, Dong
    Zhao, Zheng
    Zhang, He
    [J]. 13TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE, INTERNETWARE 2022, 2022, : 127 - 136
  • [4] Classification of Facebook News Feeds and Sentiment Analysis
    Setty, Shankar
    Jadi, Rajendra
    Shaikh, Sabya
    Mattikalli, Chandan
    Mudenagudi, Vma
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 18 - 23
  • [5] Text Mining: Sentiment Analysis on news classification
    Gomes, Helder
    Neto, Miguel de Castro
    Henriques, Roberto
    [J]. PROCEEDINGS OF THE 2013 8TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2013), 2013,
  • [6] Tweets Topic Classification and Sentiment Analysis Based on Transformer-Based Language Models
    Mandal, Ranju
    Chen, Jinyan
    Becken, Susanne
    Stantic, Bela
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (02) : 117 - 134
  • [7] Sentiment Analysis of StockTwits Using Transformer Models
    Bozanta, Aysun
    Angco, Sabrina
    Cevik, Mucahit
    Basar, Ayse
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1253 - 1258
  • [8] Longitudinal analysis of sentiment and emotion in news media headlines using automated labelling with Transformer language models
    Rozado, David
    Hughes, Ruth
    Halberstadt, Jamin
    [J]. PLOS ONE, 2022, 17 (10):
  • [9] News Classification and Categorization with Smart Function Sentiment Analysis
    Nkongolo, Mike Nkongolo Wa
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [10] Comprehensive review and comparative analysis of transformer models in sentiment analysis
    Bashiri, Hadis
    Naderi, Hassan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024,