A Data Augmentation Approach to Sentiment Analysis of MOOC Reviews

被引:0
|
作者
Li, Guangmin [1 ]
Zhou, Long [1 ]
Tong, Qiang [1 ]
Ding, Yi [1 ]
Qi, Xiaolin [2 ]
Liu, Hang [3 ]
机构
[1] Hubei Normal Univ, Sch Comp & Informat Engn, Huangshi, Peoples R China
[2] Wuhan Technol & Business Univ, Acad Affairs Off, Wuhan, Peoples R China
[3] Cent China Normal Univ, Coll Phys Sci & Technol, Wuhan, Peoples R China
关键词
Data augmentation; sentiment analysis; MOOC; natural language processing; deep learning;
D O I
10.14569/IJACSA.2024.01508122
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To address the lack of Chinese online course review corpora for aspect-based sentiment analysis, we propose Semantic Token Augmentation and Replacement (STAR), a semantic-relative distance-based data augmentation method. STAR leverages natural language processing techniques such as word embedding and semantic similarity to extract high-frequency words near aspect terms, learns their word vectors to obtain synonyms and replaces these words to enhance sentence diversity while maintaining semantic consistency. Experiments on a Chinese MOOC dataset show STAR improves Macro-F1 scores by 3.39%-8.18% for LCFS-BERT and 1.66%-8.37% for LCF-BERT compared to baselines. These results demonstrate STAR's effectiveness in improving the generalization ability of deep learning models for Chinese MOOC sentiment analysis.
引用
收藏
页码:1258 / 1264
页数:7
相关论文
共 50 条
  • [1] Data Augmentation for Sentiment Analysis in English - The Online Approach
    Jungiewicz, Michal
    Smywinski-Pohl, Aleksander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 584 - 595
  • [2] Sentiment Analysis of MOOC Reviews Based On Capsule Network
    Liu, Tianyi
    Hu, Wei
    Guo, Hong
    Li, Yining
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 222 - 227
  • [3] Will sentiment analysis need subculture? A new data augmentation approach
    Wang, Zhenhua
    He, Simin
    Xu, Guang
    Ren, Ming
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2024, 75 (06) : 655 - 670
  • [4] Lexical data augmentation for sentiment analysis
    Xiang, Rong
    Chersoni, Emmanuele
    Lu, Qin
    Huang, Chu-Ren
    Li, Wenjie
    Long, Yunfei
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2021, 72 (11) : 1432 - 1447
  • [5] Data Augmentation in a Hybrid Approach for Aspect-Based Sentiment Analysis
    Liesting, Tomas
    Frasincar, Flavius
    Trusca, Maria Mihaela
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 828 - 835
  • [6] Research on MOOC Reviews Oriented Sentiment Analysis by Awareness of Emotional Distinctions
    Li, Li
    Huang, Yi
    Ren, Chengjuan
    IEEE ACCESS, 2024, 12 : 154823 - 154831
  • [7] Sentiment analysis of MOOC reviews via ALBERT-BiLSTM model
    Wang, Cheng
    Huang, Sirui
    Zhou, Ya
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [8] Detection of spam reviews: a sentiment analysis approach
    Sunil Saumya
    Jyoti Prakash Singh
    CSI Transactions on ICT, 2018, 6 (2) : 137 - 148
  • [9] Exploratory Data Analysis and Sentiment Analysis of Drug Reviews
    Panda, Bijayalaxmi
    Panigrahi, Chhabi Rani
    Pati, Bibudhendu
    COMPUTACION Y SISTEMAS, 2022, 26 (03): : 1181 - 1189
  • [10] Guide for the application of the data augmentation approach on sets of texts in Spanish for sentiment and emotion analysis
    Benitez, Rodrigo Gutierrez
    Navarrete, Alejandra Segura
    Vidal-Castro, Christian
    Martinez-Araneda, Claudia
    PLOS ONE, 2024, 19 (09):