Improving ML Detection of IoT Botnets using Comprehensive Data and Feature Sets

被引：2

作者：

Mehra, Misha ^{[1
]}

Paranjape, Jay N. ^{[1
]}

Ribeiro, Vinay J. ^{[1
]}

机构：

[1] Indian Inst Technol Delhi, Comp Sci & Engn, Delhi, India

来源：

2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS) | 2021年

关键词：

IoT Botnet; IoT Security; Machine Learning; Malware Analysis; Sandboxing;

D O I：

10.1109/COMSNETS51098.2021.9352943

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent times, the world has seen a tremendous increase in the number of attacks on IoT devices. A majority of these attacks have been botnet attacks, where an army of compromised IoT devices is used to launch DDoS attacks on targeted systems. In this paper, we study how the choice of a dataset and the extracted features determine the performance of a Machine Learning model, given the task of classifying Linux Binaries (ELFs) as being benign or malicious. Our work focuses on Linux systems since embedded Linux is the more popular choice for building today's IoT devices and systems. We propose using 4 different types of files as the dataset for any ML model. These include system files, IoT application files, IoT botnet files and general malware files. Further, we propose using static, dynamic as well as network features to do the classification task. We show that existing methods leave out one or the other features, or file types and hence, our model outperforms them in terms of accuracy in detecting these files. While enhancing the dataset adds to the robustness of a model, utilizing all 3 types of features decreases the false positive and false negative rates non-trivially. We employ an exhaustive scenario based method for evaluating a ML model and show the importance of including each of the proposed files in a dataset. We also analyze the features and try to explain their importance for a model, using observed trends in different benign and malicious files. We perform feature extraction using the open source Limon sandbox, which prior to this work has been tested only on Ubuntu 14. We installed and configured it for Ubuntu 18, the documentation of which has been shared on Github.

引用

页码：438 / 446

页数：9

共 50 条

[41] Evaluating Standard Feature Sets Towards Increased Generalisability and Explainability of ML-Based Network Intrusion Detection
Sarhan, Mohanad
Layeghy, Siamak
Portmann, Marius
BIG DATA RESEARCH, 2022, 30
[42] Chinese Accent Detection Using Acoustic Feature Sets with Context Features
Zhao YunXue
Zheng ShiJie
Zhang Long
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 1015 - 1018
[43] Improving Pavement Anomaly Detection Using Backward Feature Elimination
Lin, Jun-Lin
Peng, Zhi-Qiang
Lai, Robert K.
BUSINESS INFORMATION SYSTEMS (BIS 2017), 2017, 288 : 341 - 349
[44] Improving feature selection in anomaly intrusion detection using specifications
Wang, Y
Miner, A
Wong, J
Uppuluri, P
DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2004, 3347 : 468 - 468
[45] Efficient Hierarchical ML-Based IoT Intrusion Detection System Leveraging PSO and Sequential Forward Feature Selection
Van Thinh Pham
Khac-Tuan Nguyen
Chien Trinh Nguyen
Hai-Chau Le
INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 : 318 - 327
[46] Distributed Feature Selection for Big Data Using Fuzzy Rough Sets
Kong, Linghe
Qu, Wenhao
Yu, Jiadi
Zuo, Hua
Chen, Guihai
Xiong, Fei
Pan, Shirui
Lin, Siyu
Qiu, Meikang
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 846 - 857
[47] FIID: Feature-Based Implicit Irregularity Detection Using Unsupervised Learning From IoT Data for Homecare of Elderly
Shang, Cuijuan
Chang, Chih-Yung
Liu, Jinjun
Zhao, Shenghui
Sinha Roy, Diptendu
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (11) : 10884 - 10896
[48] Feature selection for IoT botnet detection using equilibrium and Battle Royale Optimization
Bani Baker, Qanita
Samarneh, Alaa
Computers and Security, 2024, 147
[49] DFE: efficient IoT network intrusion detection using deep feature extraction
Basati, Amir
Faghih, Mohammad Mehdi
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15175 - 15195
[50] Intrusion Detection System Using Feature Extraction with Machine Learning Algorithms in IoT
Musleh, Dhiaa
Alotaibi, Meera
Alhaidari, Fahd
Rahman, Atta
Mohammad, Rami M.
JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2023, 12 (02)

← 1 2 3 4 5 →