"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection

被引:490
|
作者
Wang, William Yang [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
D O I
10.18653/v1/P17-2067
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic fake news detection is a challenging problem in deception detection, and it has tremendous real-world political and social impacts. However, statistical approaches to combating fake news has been dramatically limited by the lack of labeled benchmark datasets. In this paper, we present LIAR: a new, publicly available dataset for fake news detection. We collected a decade-long, 12.8K manually labeled short statements in various contexts from POLITIFACT.COM, which provides detailed analysis report and links to source documents for each case. This dataset can be used for fact-checking research as well. Notably, this new dataset is an order of magnitude larger than previously largest public fake news datasets of similar type. Empirically, we investigate automatic fake news detection based on surface-level linguistic patterns. We have designed a novel, hybrid convolutional neural network to integrate meta-data with text. We show that this hybrid approach can improve a text-only deep learning model.
引用
收藏
页码:422 / 426
页数:5
相关论文
共 50 条
  • [41] A Hybrid Model for Effective Fake News Detection with a Novel COVID-19 Dataset
    Kaliyar, Rohit Kumar
    Goswami, Anurag
    Narang, Pratik
    [J]. ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1066 - 1072
  • [42] FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms
    Qi, Peng
    Bu, Yuyan
    Cao, Juan
    Ji, Wei
    Shui, Ruihao
    Xiao, Junbin
    Wang, Danding
    Chua, Tat-Seng
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14444 - 14452
  • [43] Revisiting Shadow Detection: A New Benchmark Dataset for Complex World
    Hu, Xiaowei
    Wang, Tianyu
    Fu, Chi-Wing
    Jiang, Yitong
    Wang, Qiong
    Heng, Pheng-Ann
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1925 - 1934
  • [44] A Hybrid Multitask Learning Framework with a Fire Hawk Optimizer for Arabic Fake News Detection
    Abd Elaziz, Mohamed
    Dahou, Abdelghani
    Orabi, Dina Ahmed
    Alshathri, Samah
    Soliman, Eman M.
    Ewees, Ahmed A.
    [J]. MATHEMATICS, 2023, 11 (02)
  • [45] Fire SM: new dataset for anomaly detection of fire in video surveillance
    Mali, Shital
    Khot, Uday
    [J]. ACTA IMEKO, 2022, 11 (01):
  • [46] Beyond News Contents: The Role of Social Context for Fake New Detection
    Shu, Kai
    Wang, Suhang
    Liu, Huan
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 312 - 320
  • [47] Fake news detection models using the largest social media ground-truth dataset (TruthSeeker)
    Maysa Khalil
    Mohammad Azzeh
    [J]. International Journal of Speech Technology, 2024, 27 (2) : 389 - 404
  • [48] New explainability method for BERT-based model in fake news detection
    Szczepanski, Mateusz
    Pawlicki, Marek
    Kozik, Rafal
    Choras, Michal
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [49] Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results
    Monteiro, Rafael A.
    Santos, Roney L. S.
    Pardo, Thiago A. S.
    de Almeida, Tiago A.
    Ruiz, Evandro E. S.
    Vale, Oto A.
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 324 - 334
  • [50] New explainability method for BERT-based model in fake news detection
    Mateusz Szczepański
    Marek Pawlicki
    Rafał Kozik
    Michał Choraś
    [J]. Scientific Reports, 11