We Will Know Them by Their Style: Fake News Detection Based on Masked N-Grams

被引:1
|
作者
Perez-Santiago, Jennifer [1 ]
Villasenor-Pineda, Luis [1 ]
Montes-y-Gomez, Manuel [1 ]
机构
[1] INAOE, Lab Tecnol Lenguaje, Puebla, Mexico
来源
关键词
Fake news; Text classification; Written style;
D O I
10.1007/978-3-031-07750-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thanks to the availability of digital media, users receive daily news reports, opinions and information on a wide variety of topics. These same media allow people to easily share and transmit their own opinions, thus enriching the debate and reflection on topics of public interest. Unfortunately, these circumstances have led to the emergence of fake news to misinform. This phenomenon has reached huge proportions, becoming a serious problem. Different approaches have been proposed to automatically detect fake news, based on analyzing their content, source or dispersion. The objective of the present work is to explore whether the written style of news can be used for this task. The proposed method uses a simple strategy based on keeping the most frequent words while masking the rest. The experiments in four collections, in two languages and different topics, have led to the conclusion that there are common lexical stylistic elements among fake news.
引用
收藏
页码:245 / 254
页数:10
相关论文
共 50 条
  • [1] Fake News detection using n-grams for PAN@CLEF competition
    Damian, Sergio
    Calvo, Hiram
    Gelbukh, Alexander
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4633 - 4640
  • [2] Using of n-grams from morphological tags for fake news classification
    Kapusta, Jozef
    Drlik, Martin
    Munk, Michal
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7
  • [3] Fake News Identification: A Comparison of Parts-of-Speech and N-grams with Neural Networks
    Stoick, Brandon
    Snell, Nicholas
    Straub, Jeremy
    [J]. BIG DATA: LEARNING, ANALYTICS, AND APPLICATIONS, 2019, 10989
  • [4] Fake news spreaders profiling using N-grams of various types and SHAP-based feature selection
    Balouchzahi, Fazlourrahman
    Sidorov, Grigori
    Shashirekha, Hosahalli Lakshmaiah
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4437 - 4448
  • [5] Detection of algorithmically generated malicious domain names using masked N-grams
    Selvi, Jose
    Rodriguez, Ricardo J.
    Soria-Olivas, Emilio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 156 - 163
  • [6] On Automatic Plagiarism Detection Based on n-Grams Comparison
    Barron-Cedeno, Alberto
    Rosso, Paolo
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 696 - 700
  • [7] Musical Style Identification with n-Grams and Neural Networks
    Cruz-Alcazar, Pedro P.
    Castro-Bleda, Maria J.
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 461 - +
  • [8] Malware Detection and Classification Based on n-grams Attribute Similarity
    Zhang Fuyong
    Zhao Tiezhou
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 793 - 796
  • [9] Detection of changes in literary writing style using N-grams as style markers and supervised machine learning
    Rios-Toledo, German
    Francisco Posadas-Duran, Juan Pablo
    Sidorov, Grigori
    Alejandro Castro-Sanchez, Noe
    [J]. PLOS ONE, 2022, 17 (07):
  • [10] Anomaly Detection for Automotive Diagnostic Applications based on N-grams
    Rumez, Marcel
    Lin, Jinghua
    FuchB, Thomas
    Kriesten, Reiner
    Sax, Eric
    [J]. 2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1423 - 1429