Novel approach for Arabic fake news classification using embedding from large language features with CNN-LSTM ensemble model and explainable AI

被引:0
|
作者
Omar Ibrahim Aboulola [1 ]
Muhammad Umer [2 ]
机构
[1] University of Jeddah,College of Computer Science and Engineering
[2] The Islamia University of Bahawalpur,Department of Computer Science & Information Technology
关键词
D O I
10.1038/s41598-024-82111-5
中图分类号
学科分类号
摘要
The widespread fake news challenges the management of low-quality information, making effective detection strategies necessary. This study addresses this critical issue by advancing fake news detection in Arabic and overcoming limitations in existing approaches. Deep learning models, Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM), EfficientNetB4, Inception, Xception, ResNet, ConvLSTM and a novel voting ensemble framework combining CNN and LSTM are employed for text classification. The proposed framework integrates the ELMO word embedding technique having contextual representation capabilities, which is compared with GloVe, BERT, FastText and FastText subwords. Comprehensive experiments demonstrate that the proposed voting ensemble, combined with ELMo word embeddings, consistently outperforms previous approaches. It achieves an accuracy of 98.42%, precision of 98.54%, recall of 99.5%, and an F1 score of 98.93%, offering an efficient and highly effective solution for text classification tasks.The proposed framework benchmark against state-of-the-art transformer architectures, including BERT and RoBERTa, demonstrates competitive performance with significantly reduced inference time and enhanced interpretability accompanied by a 5-fold cross-validation technique. Furthermore, this research utilizes the LIME XAI technique to provide deeper insights into the contribution of each feature in predicting a specific target class. These findings show the proposed framework’s effectiveness in dealing with the issues of detecting false news, particularly in Arabic text. By generating higher performance metrics and displaying comparable results, this work opens the way for more reliable and interpretable text classification solutions.
引用
收藏
相关论文
共 12 条
  • [1] Diagnosis of Parkinson Disease from EEG Signals Using a CNN-LSTM Model and Explainable AI
    Bdaqli, Mohammad
    Shoeibi, Afshin
    Moridian, Parisa
    Sadeghi, Delaram
    Pouyani, Mozhde Firoozi
    Shalbaf, Ahmad
    Gorriz, Juan M.
    ARTIFICIAL INTELLIGENCE FOR NEUROSCIENCE AND EMOTIONAL SYSTEMS, PT I, IWINAC 2024, 2024, 14674 : 128 - 138
  • [2] Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation
    Alayba, Abdulaziz M.
    Palade, Vasile
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9710 - 9722
  • [3] Medical-Based Text Classification Using FastText Features and CNN-LSTM Model
    Zeghdaoui, Mohamed Walid
    Boussaid, Omar
    Bentayeb, Fadila
    Joly, Frederik
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 155 - 167
  • [4] A novel approach to fake news classification using LSTM-based deep learning models
    Padalko, Halyna
    Chomko, Vasyl
    Chumachenko, Dmytro
    FRONTIERS IN BIG DATA, 2024, 6
  • [5] A Novel Approach for the Detection of Cardiovascular Abnormalities from Electrocardiogram and Phonocardiogram Signals Using Combined CNN-LSTM Techniques
    Gnanapirakasam, Suganthi Brindha
    Manjula, J.
    Traitement du Signal, 2024, 41 (06) : 3131 - 3142
  • [6] CroLSSim: Cross-language software similarity detector using hybrid approach of LSA-based AST-MDrep features and CNN-LSTM model
    Ullah, Farhan
    Naeem, Muhammad Rashid
    Naeem, Hamad
    Cheng, Xiaochun
    Alazab, Mamoun
    International Journal of Intelligent Systems, 2022, 37 (09): : 5768 - 5795
  • [7] CroLSSim: Cross-language software similarity detector using hybrid approach of LSA-based AST-MDrep features and CNN-LSTM model
    Ullah, Farhan
    Naeem, Muhammad Rashid
    Naeem, Hamad
    Cheng, Xiaochun
    Alazab, Mamoun
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (09) : 5768 - 5795
  • [8] Improving Prediction of Arabic Fake News Using ELMO's Features-Based Tri-Ensemble Model and LIME XAI
    Aljrees, Turki
    IEEE ACCESS, 2024, 12 : 63066 - 63076
  • [9] Large-Scale News Classification using BERT Language Model: Spark NLP Approach
    Nugroho, Kuncahyo Setyo
    Sukmadewa, Anantha Yullian
    Yudistira, Novanto
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 240 - 246
  • [10] Novel Multimodal Data for Enhanced Electricity Spot Price Forecasting Using A CNN-LSTM Ensemble Learning Model for the Japan Electric Power eXchange (JEPX) Spot Market
    Wang, Ziyang
    Mae, Masahiro
    Matsuhashi, Ryuji
    International Conference on the European Energy Market, EEM, 2024,