Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers

被引:1
|
作者
Mladenovic, Djordje [1 ]
Antonijevic, Milos [2 ]
Jovanovic, Luka [2 ]
Simic, Vladimir [3 ,4 ,5 ]
Zivkovic, Miodrag [2 ]
Bacanin, Nebojsa [2 ,6 ,7 ]
Zivkovic, Tamara [8 ]
Perisic, Jasmina [2 ]
机构
[1] ICT Coll Vocat Studies, Belgrade 11000, Serbia
[2] Singidunum Univ, Fac Informat & Comp, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11010, Serbia
[4] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Taoyuan City 320315, Taiwan
[5] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
[6] SIMATS, Saveetha Sch Engn, Dept Math, Chennai 602105, Tamilnadu, India
[7] Middle East Univ, MEU Res Unit, Amman 11831, Jordan
[8] Univ Belgrade, Sch Elect Engn, Belgrade 11000, Serbia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Insider threat; Natural language processing; Hyperparameter optimization; XGBoost; AdaBoost;
D O I
10.1038/s41598-024-77240-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study examines the formidable and complex challenge of insider threats to organizational security, addressing risks such as ransomware incidents, data breaches, and extortion attempts. The research involves six experiments utilizing email, HTTP, and file content data. To combat insider threats, emerging Natural Language Processing techniques are employed in conjunction with powerful Machine Learning classifiers, specifically XGBoost and AdaBoost. The focus is on recognizing the sentiment and context of malicious actions, which are considered less prone to change compared to commonly tracked metrics like location and time of access. To enhance detection, a term frequency-inverse document frequency-based approach is introduced, providing a more robust, adaptable, and maintainable method. Moreover, the study acknowledges the significant impact of hyperparameter selection on classifier performance and employs various contemporary optimizers, including a modified version of the red fox optimization algorithm. The proposed approach undergoes testing in three simulated scenarios using a public dataset, showcasing commendable outcomes.
引用
收藏
页数:39
相关论文
共 50 条
  • [41] NLP-based clinical text classification and sentiment analyses of complex medical transcripts using transformer model and machine learning classifiers
    Pratiyush Guleria
    Neural Computing and Applications, 2025, 37 (1) : 341 - 366
  • [42] Insider Threat Detection Based on NLP Word Embedding and Machine Learning
    Haq, Mohd Anul
    Khan, Mohd Abdul Rahim
    Alshehri, Mohammed
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (01): : 619 - 635
  • [43] Research Opportunity of Insider Threat Detection based on Machine Learning Methods
    Prajitno, Noer Tjahja Moekthi
    Hadiyanto, H.
    Rochim, Adian Fatchur
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 292 - 296
  • [44] Automated Boundary Identification for Machine Learning Classifiers
    Dobslaw, Felix
    Feldt, Robert
    2024 IEEE/ACM INTERNATIONAL WORKSHOP ON SEARCH-BASED AND FUZZ TESTING, SBFT 2024, 2024, : 1 - 8
  • [45] Use of Machine Learning in Big Data Analytics for Insider Threat Detection
    Mayhew, Michael
    Atighetchi, Michael
    Adler, Aaron
    Greenstadt, Rachel
    2015 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2015), 2015, : 915 - 922
  • [46] A study on Classification of Insider threat using Markov Chain Model
    Kim, Dong-Wook
    Hong, Sung-Sam
    Han, Myung-Mook
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (04): : 1887 - 1898
  • [47] Casting Online Votes: To Predict Offline Results Using Sentiment Analysis by machine learning Classifiers
    Juneja, Pragya
    Ojha, Uma
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [48] Harnessing Twitter for Automatic Sentiment Identification Using Machine Learning Techniques
    Dash, Amiya Kumar
    Rout, Jitendra Kumar
    Jena, Sanjay Kumar
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 507 - 514
  • [49] Statistical Features Identification for Sentiment Analysis using Machine Learning Techniques
    Kamal, Ahmad
    Abulaish, Muhammad
    2013 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2013, : 178 - 181
  • [50] Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers
    Cam, Handan
    Cam, Alper Veli
    Demirel, Ugur
    Ahmed, Sana
    HELIYON, 2024, 10 (01)