Word embedding and classification methods and their effects on fake news detection

被引:2
|
作者
Hauschild, Jessica [1 ]
Eskridge, Kent [2 ]
机构
[1] US Air Force Acad, Dept Math Sci, 2345 Fairchild Dr,6D-218, Air Force Acad, CO 80840 USA
[2] Univ Nebraska, Dept Stat, 3310 Holdrege St,343E, Lincoln, NE 68503 USA
来源
关键词
Natural language processing; Text classification; Fake news; Text analysis; REPRESENTATION;
D O I
10.1016/j.mlwa.2024.100566
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language processing contains multiple methods of translating written text or spoken words into numerical information called word embeddings. Some of these embedding methods, such as Bag of Words, assume words are independent of one another. Other embedding methods, such as Bidirectional Encoder Representations from Transformers and Word2Vec, capture the relationship between words in various ways. In this paper, we are interested in comparing methods treating words as independent and methods capturing the relationship between words by looking at the effect these methods have on the classification of fake news. Using various classification methods, we compare the word embedding processes based on their effects on accuracy, precision, sensitivity, and specificity.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] WELFake: Word Embedding Over Linguistic Features for Fake News Detection
    Verma, Pawan Kumar
    Agrawal, Prateek
    Amorim, Ivone
    Prodan, Radu
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (04) : 881 - 893
  • [3] Enhancing Fake News Detection with Word Embedding: A Machine Learning and Deep Learning Approach
    Al-Tarawneh, Mutaz A. B.
    Al-irr, Omar
    Al-Maaitah, Khaled S.
    Kanj, Hassan
    Aly, Wael Hosny Fouad
    COMPUTERS, 2024, 13 (09)
  • [4] Persian Fake News Detection: Neural Representation and Classification at Word and Text Levels
    Samadi, Mohammadreza
    Mousavian, Maryam
    Momtazi, Saeedeh
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [5] Machine Learning Methods for Fake News Classification
    Ksieniewicz, Pawel
    Choras, Michal
    Kozik, Rafal
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 332 - 339
  • [6] Multi-Modal Component Embedding for Fake News Detection
    Kang, SeongKu
    Hwang, Junyoung
    Yu, Hwanjo
    PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM), 2020,
  • [7] Deep learning methods for Fake News detection
    Kresnakova, Viera Maslej
    Sarnovsky, Martin
    Butka, Peter
    IEEE JOINT 19TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 7TH INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCES AND ROBOTICS (CINTI-MACRO 2019), 2019, : 143 - 148
  • [8] Towards universal methods for fake news detection
    Pszona, Maria
    Janicka, Maria
    Wojdyga, Grzegorz
    Wawer, Aleksander
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (04) : 1004 - 1042
  • [9] Text Data Augmentation Techniques for Word Embeddings in Fake News Classification
    Kapusta, Jozef
    Drzik, David
    Steflovic, Kirsten
    Nagy, Kitti Szabo
    IEEE ACCESS, 2024, 12 : 31538 - 31550
  • [10] Active Learning for Text Classification and Fake News Detection
    Sahan, Marko
    Smidl, Vaclav
    Marik, Radek
    2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 87 - 94