Feature Selection for Fake News Classification

被引:0
|
作者
Sverdrup-Thygeson, Simen [1 ]
Haddow, Pauline C. [1 ]
机构
[1] Norwegian Univ Sci & Technol, CRAB Lab, Trondheim, Norway
关键词
Fake news; classification; feature selection; term frequency; sentiment analysis; text embeddings; BERT;
D O I
10.1109/SSCI50451.2021.9660080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An explosive growth of misleading and untrustworthy news articles has been observed over the last years. These news articles are often referred to as fake news and have been found to severely impact fair elections and democratic values. Computational Intelligence models may be applied to the classification of news articles, assuming that an efficient feature set is available as input to the model. However, the selection of appropriate feature sets is an open question for such high-dimensional tasks. A further challenge is the general applicability of feature selection strategies, where testing on a single dataset may convey misleading results. The work herein evaluates a wide-range of potential news article features resulting in twenty-five potential features. Feature selection, based on a combination of feature scoring, feature ranking and mutual information is then applied, evaluated on multiple datasets: Kaggle, Liar and FakeNewsNet. An Artificial Immune System model is applied in the feature ranking and as the classification model. The accuracy obtained is compared to state of the art fake news classification models, highlighting that the approach shows promise in terms of accuracy despite the small feature sets provided for classification.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Machine learning for fake news classification with optimal feature selection
    Muhammad Fayaz
    Atif Khan
    Muhammad Bilal
    Sana Ullah Khan
    [J]. Soft Computing, 2022, 26 : 7763 - 7771
  • [2] Machine learning for fake news classification with optimal feature selection
    Fayaz, Muhammad
    Khan, Atif
    Bilal, Muhammad
    Khan, Sana Ullah
    [J]. SOFT COMPUTING, 2022, 26 (16) : 7763 - 7771
  • [3] Normalized effect size (NES): a novel feature selection model for Urdu fake news classification
    Wasim, Muhammad
    Cheema, Sehrish Munawar
    Pires, Ivan Miguel
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 23
  • [4] Enhancing Fake News Detection by Multi-Feature Classification
    Almarashy, Ahmed Hashim Jawad
    Feizi-Derakhshi, Mohammad-Reza
    Salehpour, Pedram
    [J]. IEEE ACCESS, 2023, 11 : 139601 - 139613
  • [5] Linguistic feature based learning model for fake news detection and classification
    Choudhary, Anshika
    Arora, Anuja
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
  • [6] Automated Fake News Detection by LSTM Enabled with Optimal Feature Selection
    Nithya, S. Hannah
    Sahayadhas, Arun
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2022, 21 (03)
  • [7] Feature analysis of fake news: improving fake news detection in social media
    Leung, Johnathan
    Vatsalan, Dinusha
    Arachchilage, Nalin
    [J]. Journal of Cyber Security Technology, 2023, 7 (04) : 224 - 241
  • [8] A Study of the Impact of Evolutionary-Based Feature Selection for Fake News Detection
    Smith, Marcellus
    Richardson, Alexicia
    Brown, Brandon
    Dozier, Gerry
    King, Michael
    Morris, Joshua
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1859 - 1865
  • [9] Hybrid Feature Selection for Amharic News Document Classification
    Endalie, Demeke
    Haile, Getamesay
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [10] Research on the Feature Selection Algorithm of Chinese News Classification
    Gong, Jun-peng
    Wen, Yu-jun
    Song, Qing
    [J]. INTERNATIONAL CONFERENCE ON SIMULATION, MODELLING AND MATHEMATICAL STATISTICS (SMMS 2015), 2015, : 455 - 458