Employing synthetic data for addressing the class imbalance in aspect-based sentiment classification

被引:2
|
作者
Ganganwar, Vaishali [1 ]
Rajalakshmi, Ratnavel [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Aspect-based sentiment classification; sentiment analysis; imbalanced data; class Imbalance; paraphrasing; backtranslation; SMOTE;
D O I
10.1080/24751839.2023.2270824
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The class imbalance problem, in which the distribution of different classes in training data is unequal or skewed, is a prevailing problem. This can lead to classifier algorithms being biased, negatively impacting the performance of the minority class. In this paper, we addressed the class imbalance problem in datasets for aspect-based sentiment classification. Aspect-based Sentiment Classification (AbSC) is a type of fine-grained sentiment analysis in which sentiments about particular aspects of an entity are extracted. In this work, we addressed the issue of class imbalance by creating synthetic data. For synthetic data generation, two techniques have been proposed: paraphrasing using the PEGASUS fine-tuned model and backtranslation using the M2M100 neural machine translation model. We compared these techniques with two other class balancing techniques, such as weighted oversampling and cross-entropy loss with class weight. An extensive experimental study has been conducted on three benchmark datasets for restaurant reviews: SemEval-2014, SemEval-2015, and SemEval-2016. We applied these methods to the BERT-based deep learning model for aspect-based sentiment classification and studied the effect of balancing the data on the performance of these models. Our proposed balancing technique, using synthetic data, yielded better results than the other two existing methods for dealing with multi-class imbalance.
引用
收藏
页码:167 / 188
页数:22
相关论文
共 50 条
  • [31] Aspect-based sentiment analysis using adaptive aspect-based lexicons
    Mowlaei, Mohammad Erfan
    Abadeh, Mohammad Saniee
    Keshavarz, Hamidreza
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 148
  • [32] Sentiment Difficulty in Aspect-Based Sentiment Analysis
    Chifu, Adrian-Gabriel
    Fournier, Sebastien
    MATHEMATICS, 2023, 11 (22)
  • [33] Aspect Detection and Sentiment Classification using Deep Neural Network for Indonesian Aspect-Based Sentiment Analysis
    Ilmania, Arfinda
    Abdurrahman
    Cahyawijaya, Samuel
    Purwarianti, Ayu
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 62 - 67
  • [34] Aspect-Specific Heterogeneous Graph Convolutional Network for Aspect-Based Sentiment Classification
    Xu, Kuanhong
    Zhao, Hui
    Liu, Tianwen
    IEEE ACCESS, 2020, 8 : 139346 - 139355
  • [35] Data Augmentation in a Hybrid Approach for Aspect-Based Sentiment Analysis
    Liesting, Tomas
    Frasincar, Flavius
    Trusca, Maria Mihaela
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 828 - 835
  • [36] Cross-domain aspect-based sentiment classification with hybrid prompt
    Yuan, Shi
    Li, Meiqi
    Du, Yifei
    Xie, Yongle
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [37] Multilabel Aspect-Based Sentiment Classification for Abilify Drug User Review
    Kumar, Ashok J.
    Abirami, S.
    Trueman, Tina Esther
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 376 - 380
  • [38] LCF: A Local Context Focus Mechanism for Aspect-Based Sentiment Classification
    Zeng, Biqing
    Yang, Heng
    Xu, Ruyang
    Zhou, Wu
    Han, Xuli
    APPLIED SCIENCES-BASEL, 2019, 9 (16):
  • [39] Aspect-Based Sentiment Classification Using Interactive Gated Convolutional Network
    Kumar, Avinash
    Narapareddy, Vishnu Teja
    Srikanth, Veerubhotla Aditya
    Neti, Lalita Bhanu Murthy
    Malapati, Aruna
    IEEE ACCESS, 2020, 8 : 22445 - 22453
  • [40] Aspect-based Sentiment Classification with Dual Cooperative Graph Attention Networks
    Cui, Xiyong
    Fang, Wei
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 135 - 141