Improving ML Detection of IoT Botnets using Comprehensive Data and Feature Sets

被引:2
|
作者
Mehra, Misha [1 ]
Paranjape, Jay N. [1 ]
Ribeiro, Vinay J. [1 ]
机构
[1] Indian Inst Technol Delhi, Comp Sci & Engn, Delhi, India
关键词
IoT Botnet; IoT Security; Machine Learning; Malware Analysis; Sandboxing;
D O I
10.1109/COMSNETS51098.2021.9352943
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent times, the world has seen a tremendous increase in the number of attacks on IoT devices. A majority of these attacks have been botnet attacks, where an army of compromised IoT devices is used to launch DDoS attacks on targeted systems. In this paper, we study how the choice of a dataset and the extracted features determine the performance of a Machine Learning model, given the task of classifying Linux Binaries (ELFs) as being benign or malicious. Our work focuses on Linux systems since embedded Linux is the more popular choice for building today's IoT devices and systems. We propose using 4 different types of files as the dataset for any ML model. These include system files, IoT application files, IoT botnet files and general malware files. Further, we propose using static, dynamic as well as network features to do the classification task. We show that existing methods leave out one or the other features, or file types and hence, our model outperforms them in terms of accuracy in detecting these files. While enhancing the dataset adds to the robustness of a model, utilizing all 3 types of features decreases the false positive and false negative rates non-trivially. We employ an exhaustive scenario based method for evaluating a ML model and show the importance of including each of the proposed files in a dataset. We also analyze the features and try to explain their importance for a model, using observed trends in different benign and malicious files. We perform feature extraction using the open source Limon sandbox, which prior to this work has been tested only on Ubuntu 14. We installed and configured it for Ubuntu 18, the documentation of which has been shared on Github.
引用
收藏
页码:438 / 446
页数:9
相关论文
共 50 条
  • [41] Evaluating Standard Feature Sets Towards Increased Generalisability and Explainability of ML-Based Network Intrusion Detection
    Sarhan, Mohanad
    Layeghy, Siamak
    Portmann, Marius
    BIG DATA RESEARCH, 2022, 30
  • [42] Chinese Accent Detection Using Acoustic Feature Sets with Context Features
    Zhao YunXue
    Zheng ShiJie
    Zhang Long
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 1015 - 1018
  • [43] Improving Pavement Anomaly Detection Using Backward Feature Elimination
    Lin, Jun-Lin
    Peng, Zhi-Qiang
    Lai, Robert K.
    BUSINESS INFORMATION SYSTEMS (BIS 2017), 2017, 288 : 341 - 349
  • [44] Improving feature selection in anomaly intrusion detection using specifications
    Wang, Y
    Miner, A
    Wong, J
    Uppuluri, P
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2004, 3347 : 468 - 468
  • [45] Efficient Hierarchical ML-Based IoT Intrusion Detection System Leveraging PSO and Sequential Forward Feature Selection
    Van Thinh Pham
    Khac-Tuan Nguyen
    Chien Trinh Nguyen
    Hai-Chau Le
    INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 : 318 - 327
  • [46] Distributed Feature Selection for Big Data Using Fuzzy Rough Sets
    Kong, Linghe
    Qu, Wenhao
    Yu, Jiadi
    Zuo, Hua
    Chen, Guihai
    Xiong, Fei
    Pan, Shirui
    Lin, Siyu
    Qiu, Meikang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 846 - 857
  • [47] FIID: Feature-Based Implicit Irregularity Detection Using Unsupervised Learning From IoT Data for Homecare of Elderly
    Shang, Cuijuan
    Chang, Chih-Yung
    Liu, Jinjun
    Zhao, Shenghui
    Sinha Roy, Diptendu
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (11) : 10884 - 10896
  • [48] Feature selection for IoT botnet detection using equilibrium and Battle Royale Optimization
    Bani Baker, Qanita
    Samarneh, Alaa
    Computers and Security, 2024, 147
  • [49] DFE: efficient IoT network intrusion detection using deep feature extraction
    Basati, Amir
    Faghih, Mohammad Mehdi
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15175 - 15195
  • [50] Intrusion Detection System Using Feature Extraction with Machine Learning Algorithms in IoT
    Musleh, Dhiaa
    Alotaibi, Meera
    Alhaidari, Fahd
    Rahman, Atta
    Mohammad, Rami M.
    JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2023, 12 (02)