On the use of text augmentation for stance and fake news detection

被引:9
|
作者
Salah, Ilhem [1 ,2 ]
Jouini, Khaled [1 ]
Korbaa, Ouajdi [1 ]
机构
[1] Univ Sousse, MARS Res Lab LR17ES05, ISITCom, Hosp Sousse, Sousse, Tunisia
[2] Univ Sousse, Hosp Sousse, MARS Res Lab LR17ES05, ISITCom, Sousse 4011, Tunisia
关键词
Stance and fake news detection; text augmentation; ensemble learning; class imbalance;
D O I
10.1080/24751839.2023.2198820
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Augmentation (DA) aims at synthesizing new training instances by applying transformations to available ones. DA has several well-known benefits such as: (i) increasing generalization ability; (ii) preventing data scarcity; and (iii) helping resolve class imbalance issues. In this work, we investigate the use of DA for stance and fake news detection. In the first part of our work, we explore the effect of various DA techniques on the performance of common classification algorithms. Our study reveals that the motto 'the more, the better' is the wrong approach regarding text augmentation and that there is no one-size-fits-all text augmentation technique. The second part of our work leverages the results of our study to propose a novel augmentation-based, ensemble learning approach. The proposed approach leverages text augmentation to enhance base learners' diversity and accuracy, ergo the predictive performance of the ensemble. The third part of our work experimentally investigates the use of DA to cope with the class imbalance problem. Class imbalance is very common in stance and fake news detection and often results in biased models. In this work we show how and to what extent text augmentation can help resolving moderate and severe imbalance.
引用
收藏
页码:359 / 375
页数:17
相关论文
共 50 条
  • [1] Augmentation-Based Ensemble Learning for Stance and Fake News Detection
    Salah, Ilhem
    Jouini, Khaled
    Korbaa, Ouajdi
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 29 - 41
  • [2] Effect of Text Augmentation and Adversarial Training on Fake News Detection
    Ahmed, Hadeer
    Traore, Issa
    Saad, Sherif
    Mamun, Mohammad
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4775 - 4789
  • [3] Text Data Augmentation Techniques for Fake News Detection in the Romanian Language
    Bucos, Marian
    Tucudean, Georgiana
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [4] Fake news stance detection using selective features and FakeNET
    Aljrees, Turki
    Cheng, Xiaochun
    Ahmed, Mian Muhammad
    Umer, Muhammad
    Majeed, Rizwan
    Alnowaiser, Khaled
    Abuzinadah, Nihal
    Ashraf, Imran
    PLOS ONE, 2023, 18 (07):
  • [5] SERN: STANCE EXTRACTION AND REASONING NETWORK FOR FAKE NEWS DETECTION
    Xie, Jianhui
    Liu, Song
    Liu, Ruixin
    Zhang, Yinghong
    Zhu, Yuesheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2520 - 2524
  • [6] Stance Detection in the Context of Fake News-A New Approach
    Alsmadi, Izzat
    Alazzam, Iyad
    Al-Ramahi, Mohammad
    Zarour, Mohammad
    FUTURE INTERNET, 2024, 16 (10)
  • [7] Text Data Augmentation Techniques for Word Embeddings in Fake News Classification
    Kapusta, Jozef
    Drzik, David
    Steflovic, Kirsten
    Nagy, Kitti Szabo
    IEEE ACCESS, 2024, 12 : 31538 - 31550
  • [8] Toward stance parameter algorithm with aggregate comments for fake news detection
    Yao, Yinnan
    Tang, Changhao
    Ma, Kun
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (05) : 443 - 454
  • [9] Gradual Argumentation Evaluation for Stance Aggregation in Automated Fake News Detection
    Kotonya, Neema
    Toni, Francesca
    6TH WORKSHOP ON ARGUMENT MINING (ARGMINING 2019), 2019, : 156 - 166
  • [10] Exploiting stance similarity and graph neural networks for fake news detection
    Soga, Kayato
    Yoshida, Soh
    Muneyasu, Mitsuji
    PATTERN RECOGNITION LETTERS, 2024, 177 : 26 - 32