Cyber risk prediction through social media big data analytics and statistical machine learning

被引:31
|
作者
Subroto, Athor [1 ,2 ]
Apriyana, Andri [3 ]
机构
[1] Univ Indonesia, Fac Econ & Business, Dept Management, Depok, Indonesia
[2] Univ Indonesia, Sch Strateg & Global Studies SKSG, Jakarta, Indonesia
[3] Grp Audit & Risk Advisory PT Astra Int Tbk, Jakarta, Indonesia
关键词
Predictive analytics; Machine learning; Big data; Cyber risks; Social media; Non-traditional actuary;
D O I
10.1186/s40537-019-0216-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a natural outcome of achieving equilibrium, digital economic progress will most likely be subject to increased cyber risks. Therefore, the purpose of this study is to present an algorithmic model that utilizes social media big data analytics and statistical machine learning to predict cyber risks. The data for this study consisted of 83,015 instances from the common vulnerabilities and exposures (CVE) database (early 1999 to March 2017) and 25,599 cases of cyber risks from Twitter (early 2016 to March 2017), after which 1000 instances from both platforms were selected. The predictions were made by analyzing the software vulnerabilities to threats, based on social media conversations, while prediction accuracy was measured by comparing the cyber risk data from Twitter with that from the CVE database. Utilizing confusion matrix, we can achieve the best prediction by using Rweka package to carry out machine learning (ML) experimentation and artificial neural network (ANN) with the accuracy rate of 96.73%. Thus, in this paper, we offer new insights into cyber risks and how such vulnerabilities can be adequately understood and predicted. The findings of this study can be used by managers of public and private companies to formulate effective strategies for reducing cyber risks to critical infrastructures.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Machine learning and big data analytics in mood disorders
    Yang, Lu
    Chen, Jun
    [J]. FRONTIERS IN PSYCHIATRY, 2024, 15
  • [32] A Smart Social Insurance Big Data Analytics Framework Based on Machine Learning Algorithms
    Senousy, Youssef
    Shehab, Abdulaziz
    Hanna, Wael K.
    Riad, Alaa M.
    El-bakry, Hazem A.
    Elkhamisy, Nashaat
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2020, 20 (01) : 95 - 111
  • [33] Incorporating Big Data Tools for Social Media Analytics in a Business Analytics Course
    Zadeh, Amir H.
    Zolbanin, Hamed M.
    Sharda, Ramesh
    [J]. Journal of Information Systems Education, 2021, 32 (03) : 176 - 198
  • [34] Big Data vs. Data Mining for Social Media Analytics
    Danubianu, M.
    Barila, A.
    [J]. SMART 2014 - SOCIAL MEDIA IN ACADEMIA: RESEARCH AND TEACHING, 2015, : 261 - 269
  • [35] Using Big Data-machine learning models for diabetes prediction and flight delays analytics
    Thérence Nibareke
    Jalal Laassiri
    [J]. Journal of Big Data, 7
  • [36] Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique
    Venkatesh, R.
    Balasubramanian, C.
    Kahappan, M.
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (08)
  • [37] Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique
    R. Venkatesh
    C. Balasubramanian
    M. Kaliappan
    [J]. Journal of Medical Systems, 2019, 43
  • [38] Using Big Data-machine learning models for diabetes prediction and flight delays analytics
    Nibareke, Therence
    Laassiri, Jalal
    [J]. JOURNAL OF BIG DATA, 2020, 7 (01)
  • [39] Big data on a smaller scale: A social media analytics assignment
    Fischbach, Sarah
    Zarzosa, Jennifer
    [J]. JOURNAL OF EDUCATION FOR BUSINESS, 2018, 93 (03) : 142 - 148
  • [40] A Systematic Review Towards Big Data Analytics in Social Media
    Md.Saifur Rahman
    Hassan Reza
    [J]. Big Data Mining and Analytics, 2022, (03) : 228 - 244