Feature Selection for Fake News Classification

被引：0

作者：

Sverdrup-Thygeson, Simen ^{[1
]}

Haddow, Pauline C. ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol, CRAB Lab, Trondheim, Norway

来源：

2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年

关键词：

Fake news; classification; feature selection; term frequency; sentiment analysis; text embeddings; BERT;

D O I：

10.1109/SSCI50451.2021.9660080

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An explosive growth of misleading and untrustworthy news articles has been observed over the last years. These news articles are often referred to as fake news and have been found to severely impact fair elections and democratic values. Computational Intelligence models may be applied to the classification of news articles, assuming that an efficient feature set is available as input to the model. However, the selection of appropriate feature sets is an open question for such high-dimensional tasks. A further challenge is the general applicability of feature selection strategies, where testing on a single dataset may convey misleading results. The work herein evaluates a wide-range of potential news article features resulting in twenty-five potential features. Feature selection, based on a combination of feature scoring, feature ranking and mutual information is then applied, evaluated on multiple datasets: Kaggle, Liar and FakeNewsNet. An Artificial Immune System model is applied in the feature ranking and as the classification model. The accuracy obtained is compared to state of the art fake news classification models, highlighting that the approach shows promise in terms of accuracy despite the small feature sets provided for classification.

引用

页数：8

共 50 条

[31] A Comparative Study on the Swarm Intelligence Based Feature Selection Approaches for Fake And Real Fingerprint Classification
Sasikala, V.
LakshmiPrabha, V.
[J]. PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,
[32] Optimization of Text Feature Selection Process Based on Advanced Searching for News Classification
Kyaw, Khin Sandar
Limsiroratana, Somchai
[J]. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2020, 11 (04) : 1 - 23
[33] Fake News Classification and Topic Modeling in Brazilian Portuguese
Paixao, Maik
Lima, Rinaldo
Espinasse, Bernard
[J]. 2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 427 - 432
[34] Ensemble Learning Approach on Indonesian Fake News Classification
Al-Ash, Herley Shaori
Putri, Mutia Fadhila
Mursanto, Petrus
Bustamam, Alhadi
[J]. 2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
[35] Improving fake news classification using dependency grammar
Nagy, Kitti
Kapusta, Jozef
[J]. PLOS ONE, 2021, 16 (09):
[36] An Arabic Corpus of Fake News: Collection, Analysis and Classification
Alkhair, Maysoon
Meftouh, Karima
Smaili, Kamel
Othman, Nouha
[J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 292 - 302
[37] Fake News Classification Based on Content Level Features
Lai, Chun-Ming
Chen, Mei-Hua
Kristiani, Endah
Verma, Vinod Kumar
Yang, Chao-Tung
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
[38] BerConvoNet: A deep learning framework for fake news classification
Choudhary, Monika
Chouhan, Satyendra Singh
Pilli, S. Emmanuel
Vipparthi, Santosh Kumar
[J]. APPLIED SOFT COMPUTING, 2021, 110
[39] Fake news: a classification proposal and a future research agenda
Rahmanian, Emad
[J]. SPANISH JOURNAL OF MARKETING-ESIC, 2023, 27 (01) : 60 - 78
[40] A transformer-based architecture for fake news classification
Divyam Mehta
Aniket Dwivedi
Arunabha Patra
M. Anand Kumar
[J]. Social Network Analysis and Mining, 2021, 11

← 1 2 3 4 5 →