Applying Machine Learning Techniques for Religious Extremism Detection on Online User Contents

被引:4
|
作者
Mussiraliyeva, Shynar [1 ]
Omarov, Batyrkhan [1 ]
Yoo, Paul [1 ,2 ]
Bolatbek, Milana [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
[2] Univ London, Birkbeck Coll, CSIS, London, England
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 01期
关键词
Extremism; religious extremism; machine learning; social media; social network; natural language processing; NLP; ISLAMIST;
D O I
10.32604/cmc.2022.019189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this research paper, we propose a corpus for the task of detecting religious extremism in social networks and open sources and compare various machine learning algorithms for the binary classification problem using a previously created corpus, thereby checking whether it is possible to detect extremist messages in the Kazakh language. To do this, the authors trained models using six classic machine-learning algorithms such as Support Vector Machine, Decision Tree, Random Forest, K Nearest Neighbors, Naive Bayes, and Logistic Regression. To increase the accuracy of detecting extremist texts, we used various characteristics such as Statistical Features, TF-IDF, POS, LIWC, and applied oversampling and undersampling techniques to handle imbalanced data. As a result, we achieved 98% accuracy in detecting religious extremism in Kazakh texts for the collected dataset. Testing the developed machine learning models in various databases that are often found in everyday life "Jokes", "News", "Toxic content", "Spam", "Advertising" has also shown high rates of extremism detection.
引用
收藏
页码:915 / 934
页数:20
相关论文
共 50 条
  • [1] On applying machine learning techniques for design pattern detection
    Zanoni, Marco
    Fontana, Francesca Arcelli
    Stella, Fabio
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 103 : 102 - 117
  • [2] Social Media Mining to Detect Online Violent Extremism using Machine Learning Techniques
    Mussiraliyeva, Shynar
    Bagitova, Kalamkas
    Sultan, Daniyar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 1384 - 1393
  • [3] Bigram Based Deep Neural Network for Extremism Detection in Online User Generated Contents in the Kazakh Language
    Mussiraliyeva, Shynar
    Omarov, Batyrkhan
    Bolatbek, Milana
    Bagitova, Kalamkas
    Alimzhanova, Zhanna
    [J]. ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 559 - 570
  • [4] YouTube based religious hate speech and extremism detection dataset with machine learning baselines
    Ashraf, Noman
    Rafiq, Abid
    Butt, Sabur
    Shehzad, Hafiz Muhammad Faisal
    Sidorov, Grigori
    Gelbukh, Alexander
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4769 - 4777
  • [5] Android malware detection applying feature selection techniques and machine learning
    Keyvanpour, Mohammad Reza
    Shirzad, Mehrnoush Barani
    Heydarian, Farideh
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9517 - 9531
  • [6] Applying machine learning techniques for detection of malicious code in network traffic
    Elovici, Yuval
    Shabtai, Asaf
    Moskovitch, Robert
    Tahan, Gil
    Glezer, Chanan
    [J]. KI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4667 : 44 - +
  • [7] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    [J]. Multimedia Tools and Applications, 2023, 82 : 9517 - 9531
  • [8] Online Payment Fraud Detection Model Using Machine Learning Techniques
    Almazroi, Abdulwahab Ali
    Ayub, Nasir
    [J]. IEEE ACCESS, 2023, 11 : 137188 - 137203
  • [9] Volcanic clouds detection applying machine learning techniques to GNSS radio occultations
    Hammouti, Mohammed
    Gencarelli, Christian Natale
    Sterlacchini, Simone
    Biondi, Riccardo
    [J]. GPS SOLUTIONS, 2024, 28 (03)
  • [10] Online Detection of Shill Bidding Fraud Based on Machine Learning Techniques
    Ganguly, Swati
    Sadaoui, Samira
    [J]. RECENT TRENDS AND FUTURE TECHNOLOGY IN APPLIED INTELLIGENCE, IEA/AIE 2018, 2018, 10868 : 303 - 314