Rumor detection in Arabic tweets using semi-supervised and unsupervised expectation-maximization

被引:48
|
作者
Alzanin, Samah M. [1 ]
Azmi, Aqil M. [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11543, Saudi Arabia
关键词
Rumor detection; Arabic; Semi-supervised; Unsupervised; Expectation-maximization; Twitter;
D O I
10.1016/j.knosys.2019.104945
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the continued development of social networks, the spreading of information has become faster than ever. Consequently, this has resulted in a problem with the reliability of the information, where any user can publish whatever he/she wants. Automated systems capable of detecting fake contents with similar striking speed as the information being disseminated are urgently required. Detecting rumors in Arabic language social networks has lagged behind the work on other languages, particularly in English. In this paper, we address the problem of detecting rumors in Arabic tweets. We used a set of features extracted from the user and the content. These features were analyzed to determine their significance. Semi-supervised expectation-maximization (E-M) was used to train the proposed system with topics of newsworthy tweets. A comparison with supervised Gaussian Naive Bayes (NB) showed that our semi-supervised system, using a small base of labeled data, outperforms Gaussian NB achieving an accuracy of 78.6%. The performance of the unsupervised E-M depends on the initial values, and we achieved an F-1 score of 80% in one of our experiments. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] SERBoost: Semi-supervised Boosting with Expectation Regularization
    Saffari, Amir
    Grabner, Helmut
    Bischof, Horst
    COMPUTER VISION - ECCV 2008, PT III, PROCEEDINGS, 2008, 5304 : 588 - 601
  • [32] Semi-Supervised Self-Learning for Arabic Hate Speech Detection
    Alsafari, Safa
    Sadaoui, Samira
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 863 - 868
  • [33] Automatic image segmentation for concealed object detection using the expectation-maximization algorithm
    Lee, Dong-Su
    Yeom, Seokwon
    Son, Jung-Young
    Kim, Shin-Hwan
    OPTICS EXPRESS, 2010, 18 (10): : 10659 - 10667
  • [34] Sparse Bayesian learning for structural damage detection using expectation-maximization technique
    Hou, Rongrong
    Xia, Yong
    Zhou, Xiaoqing
    Huang, Yong
    STRUCTURAL CONTROL & HEALTH MONITORING, 2019, 26 (05):
  • [35] Semi-supervised information-maximization clustering
    Calandriello, Daniele
    Niu, Gang
    Sugiyama, Masashi
    NEURAL NETWORKS, 2014, 57 : 103 - 111
  • [36] Semi-Supervised Self Training to Assess the Credibility of Tweets
    Gao, Leyu
    Shah, Sandeep
    Assery, Nasser
    Yuan, Xiaohong
    Qu, Xiuli
    Roy, Kaushik
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1532 - 1537
  • [37] From unsupervised to semi-supervised anomaly detection methods for HRRP targets
    Bauw, Martin
    Velasco-Forero, Santiago
    Angulo, Jesus
    Adnet, Claude
    Airiau, Olivier
    2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
  • [38] Refactoring Acoustic Models using Variational Expectation-Maximization
    Dognin, Pierre L.
    Hershey, John R.
    Goel, Vaibhava
    Olsen, Peder A.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 228 - 231
  • [39] An Effective Approach for Rumor Detection of Arabic Tweets Using eXtreme Gradient Boosting Method
    Gumaei, Abdu
    Al-Rakhami, Mabrook S.
    Hassan, Mohammad Mehedi
    De Albuquerque, Victor Hugo C.
    Camacho, David
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [40] Inexact matching of ontology graphs using expectation-maximization
    Doshi, Prashant
    Kolli, Ravikanth
    Thomas, Christopher
    JOURNAL OF WEB SEMANTICS, 2009, 7 (02): : 90 - 106