Employing synthetic data for addressing the class imbalance in aspect-based sentiment classification

被引:1
|
作者
Ganganwar, Vaishali [1 ]
Rajalakshmi, Ratnavel [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Aspect-based sentiment classification; sentiment analysis; imbalanced data; class Imbalance; paraphrasing; backtranslation; SMOTE;
D O I
10.1080/24751839.2023.2270824
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The class imbalance problem, in which the distribution of different classes in training data is unequal or skewed, is a prevailing problem. This can lead to classifier algorithms being biased, negatively impacting the performance of the minority class. In this paper, we addressed the class imbalance problem in datasets for aspect-based sentiment classification. Aspect-based Sentiment Classification (AbSC) is a type of fine-grained sentiment analysis in which sentiments about particular aspects of an entity are extracted. In this work, we addressed the issue of class imbalance by creating synthetic data. For synthetic data generation, two techniques have been proposed: paraphrasing using the PEGASUS fine-tuned model and backtranslation using the M2M100 neural machine translation model. We compared these techniques with two other class balancing techniques, such as weighted oversampling and cross-entropy loss with class weight. An extensive experimental study has been conducted on three benchmark datasets for restaurant reviews: SemEval-2014, SemEval-2015, and SemEval-2016. We applied these methods to the BERT-based deep learning model for aspect-based sentiment classification and studied the effect of balancing the data on the performance of these models. Our proposed balancing technique, using synthetic data, yielded better results than the other two existing methods for dealing with multi-class imbalance.
引用
收藏
页码:167 / 188
页数:22
相关论文
共 50 条
  • [1] TAWC: Text Augmentation with Word Contributions for Imbalance Aspect-Based Sentiment Classification
    Santoso, Noviyanti
    Mendonça, Israel
    Aritsugi, Masayoshi
    Applied Sciences (Switzerland), 2024, 14 (19):
  • [2] A Survey on Aspect-Based Sentiment Classification
    Brauwers, Gianni
    Frasincar, Flavius
    ACM COMPUTING SURVEYS, 2023, 55 (04)
  • [3] Aspect-based Twitter Sentiment Classification
    Lek, Hsiang Hui
    Poo, Danny C. C.
    2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 366 - 373
  • [4] Multitask Learning for Aspect-Based Sentiment Classification
    Yao, Chunhua
    Song, Xinyu
    Zhang, Xuelei
    Zhao, Weicheng
    Feng, Ao
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [5] Hybrid sentiment classification on twitter aspect-based sentiment analysis
    Zainuddin, Nurulhuda
    Selamat, Ali
    Ibrahim, Roliana
    APPLIED INTELLIGENCE, 2018, 48 (05) : 1218 - 1232
  • [6] Hybrid sentiment classification on twitter aspect-based sentiment analysis
    Nurulhuda Zainuddin
    Ali Selamat
    Roliana Ibrahim
    Applied Intelligence, 2018, 48 : 1218 - 1232
  • [7] Data augmentation for aspect-based sentiment analysis
    Guangmin Li
    Hui Wang
    Yi Ding
    Kangan Zhou
    Xiaowei Yan
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 125 - 133
  • [8] Aspect-based sentiment classification model employing whale-optimized adaptive neural network
    Balaganesh, Nallathambi
    Muneeswaran, K.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2021, 69 (03)
  • [9] Data augmentation for aspect-based sentiment analysis
    Li, Guangmin
    Wang, Hui
    Ding, Yi
    Zhou, Kangan
    Yan, Xiaowei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (01) : 125 - 133
  • [10] Aspect-based Sentiment Classification via Reinforcement Learning
    Wang, Lichen
    Zong, Bo
    Liu, Yunyu
    Qin, Can
    Cheng, Wei
    Yu, Wenchao
    Zhang, Xuchao
    Chen, Haifeng
    Fu, Yun
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1391 - 1396