Feature Selection for Fake News Classification

被引:0
|
作者
Sverdrup-Thygeson, Simen [1 ]
Haddow, Pauline C. [1 ]
机构
[1] Norwegian Univ Sci & Technol, CRAB Lab, Trondheim, Norway
关键词
Fake news; classification; feature selection; term frequency; sentiment analysis; text embeddings; BERT;
D O I
10.1109/SSCI50451.2021.9660080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An explosive growth of misleading and untrustworthy news articles has been observed over the last years. These news articles are often referred to as fake news and have been found to severely impact fair elections and democratic values. Computational Intelligence models may be applied to the classification of news articles, assuming that an efficient feature set is available as input to the model. However, the selection of appropriate feature sets is an open question for such high-dimensional tasks. A further challenge is the general applicability of feature selection strategies, where testing on a single dataset may convey misleading results. The work herein evaluates a wide-range of potential news article features resulting in twenty-five potential features. Feature selection, based on a combination of feature scoring, feature ranking and mutual information is then applied, evaluated on multiple datasets: Kaggle, Liar and FakeNewsNet. An Artificial Immune System model is applied in the feature ranking and as the classification model. The accuracy obtained is compared to state of the art fake news classification models, highlighting that the approach shows promise in terms of accuracy despite the small feature sets provided for classification.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A Comparative Study on the Swarm Intelligence Based Feature Selection Approaches for Fake And Real Fingerprint Classification
    Sasikala, V.
    LakshmiPrabha, V.
    [J]. PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,
  • [32] Optimization of Text Feature Selection Process Based on Advanced Searching for News Classification
    Kyaw, Khin Sandar
    Limsiroratana, Somchai
    [J]. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2020, 11 (04) : 1 - 23
  • [33] Fake News Classification and Topic Modeling in Brazilian Portuguese
    Paixao, Maik
    Lima, Rinaldo
    Espinasse, Bernard
    [J]. 2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 427 - 432
  • [34] Ensemble Learning Approach on Indonesian Fake News Classification
    Al-Ash, Herley Shaori
    Putri, Mutia Fadhila
    Mursanto, Petrus
    Bustamam, Alhadi
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [35] Improving fake news classification using dependency grammar
    Nagy, Kitti
    Kapusta, Jozef
    [J]. PLOS ONE, 2021, 16 (09):
  • [36] An Arabic Corpus of Fake News: Collection, Analysis and Classification
    Alkhair, Maysoon
    Meftouh, Karima
    Smaili, Kamel
    Othman, Nouha
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 292 - 302
  • [37] Fake News Classification Based on Content Level Features
    Lai, Chun-Ming
    Chen, Mei-Hua
    Kristiani, Endah
    Verma, Vinod Kumar
    Yang, Chao-Tung
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [38] BerConvoNet: A deep learning framework for fake news classification
    Choudhary, Monika
    Chouhan, Satyendra Singh
    Pilli, S. Emmanuel
    Vipparthi, Santosh Kumar
    [J]. APPLIED SOFT COMPUTING, 2021, 110
  • [39] Fake news: a classification proposal and a future research agenda
    Rahmanian, Emad
    [J]. SPANISH JOURNAL OF MARKETING-ESIC, 2023, 27 (01) : 60 - 78
  • [40] A transformer-based architecture for fake news classification
    Divyam Mehta
    Aniket Dwivedi
    Arunabha Patra
    M. Anand Kumar
    [J]. Social Network Analysis and Mining, 2021, 11