Word embedding and classification methods and their effects on fake news detection

被引:2
|
作者
Hauschild, Jessica [1 ]
Eskridge, Kent [2 ]
机构
[1] US Air Force Acad, Dept Math Sci, 2345 Fairchild Dr,6D-218, Air Force Acad, CO 80840 USA
[2] Univ Nebraska, Dept Stat, 3310 Holdrege St,343E, Lincoln, NE 68503 USA
来源
关键词
Natural language processing; Text classification; Fake news; Text analysis; REPRESENTATION;
D O I
10.1016/j.mlwa.2024.100566
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language processing contains multiple methods of translating written text or spoken words into numerical information called word embeddings. Some of these embedding methods, such as Bag of Words, assume words are independent of one another. Other embedding methods, such as Bidirectional Encoder Representations from Transformers and Word2Vec, capture the relationship between words in various ways. In this paper, we are interested in comparing methods treating words as independent and methods capturing the relationship between words by looking at the effect these methods have on the classification of fake news. Using various classification methods, we compare the word embedding processes based on their effects on accuracy, precision, sensitivity, and specificity.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Fake News Detection on Social Media Using Ensemble Methods
    Ilyas, Muhammad Ali
    Rehman, Abdul
    Abbas, Assad
    Kim, Dongsun
    Naseem, Muhammad Tahir
    Allah, Nasro Min
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4525 - 4549
  • [22] A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities
    Zhou, Xinyi
    Zafarani, Reza
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [23] Fake news detection: A survey of graph neural network methods
    Phan, Huyen Trang
    Nguyen, Ngoc Thanh
    Hwang, Dosam
    APPLIED SOFT COMPUTING, 2023, 139
  • [24] Rapid detection of fake news based on machine learning methods
    Probierz, Barbara
    Stefanski, Piotr
    Kozak, Jan
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 2893 - 2902
  • [25] Estimating vulnerability metrics with word embedding and multiclass classification methods
    Kekul, Hakan
    Ergen, Burhan
    Arslan, Halil
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (01) : 247 - 270
  • [26] Estimating vulnerability metrics with word embedding and multiclass classification methods
    Hakan Kekül
    Burhan Ergen
    Halil Arslan
    International Journal of Information Security, 2024, 23 : 247 - 270
  • [27] Word embedding and text classification based on deep learning methods
    Li, Saihan
    Gong, Bing
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [28] A Multi-Kernel Optimized Convolutional Neural Network With Urdu Word Embedding to Detect Fake News
    Zaheer, Khurram
    Talib, Muhammad Ramzan
    Hanif, Muhammad Kashif
    Sarwar, Muhammad Umer
    IEEE ACCESS, 2023, 11 : 142371 - 142382
  • [29] Fake News Detection Using Time Series and User Features Classification
    Previti, Marialaura
    Rodriguez-Fernandez, Victor
    Camacho, David
    Carchiolo, Vincenza
    Malgeri, Michele
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2020, 2020, 12104 : 339 - 353
  • [30] Linguistic feature based learning model for fake news detection and classification
    Choudhary, Anshika
    Arora, Anuja
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169