Application of Machine Learning Algorithms for the Validation of a New CoAP-IoT Anomaly Detection Dataset

被引:8
|
作者
Vigoya, Laura [1 ]
Pardal, Alberto [1 ]
Fernandez, Diego [1 ]
Carneiro, Victor [1 ]
机构
[1] Ctr Informat & Commun Technol Res CITIC, Campus Elvina S-N, La Coruna 15071, Spain
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
关键词
IoT; CoAP; sensors; dataset validation; machine learning; intrusion detection systems; analysis; metric; algorithm design; INTERNET;
D O I
10.3390/app13074482
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the rise in smart devices, the Internet of Things (IoT) has been established as one of the preferred emerging platforms to fulfil their need for simple interconnections. The use of specific protocols such as constrained application protocol (CoAP) has demonstrated improvements in the performance of the networks. However, power-, bandwidth-, and memory-constrained sensing devices constitute a weakness in the security of the system. One way to mitigate these security problems is through anomaly-based intrusion detection systems, which aim to estimate the behaviour of the systems based on their "normal" nature. Thus, to develop anomaly-based intrusion detection systems, it is necessary to have a suitable dataset that allows for their analysis. Due to the lack of a public dataset in the CoAP-IoT environment, this work aims to present a complete and labelled CoAP-IoT anomaly detection dataset (CIDAD) based on real-world traffic, with a sufficient trace size and diverse anomalous scenarios. The modelled data were implemented in a virtual sensor environment, including three types of anomalies in the CoAP data. The validation of the dataset was carried out using five shallow machine learning techniques: logistic regression, naive Bayes, random forest, AdaBoost, and support vector machine. Detailed analyses of the dataset, data conditioning, feature engineering, and hyperparameter tuning are presented. The evaluation metrics used in the performance comparison are accuracy, precision, recall, F1 score, and kappa score. The system achieved 99.9% accuracy for decision tree models. Random forest established itself as the best model, obtaining a 99.9% precision and F1 score, 100% recall, and a Cohen's kappa statistic of 0.99.
引用
收藏
页数:25
相关论文
共 50 条
  • [11] Application and evaluation of selected machine learning algorithms in anomaly detection module for SOC
    Warzynski, A.
    Bienias, P.
    Kolaczek, G.
    [J]. DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 971 - 978
  • [12] Application of machine learning algorithms to KDD intrusion detection dataset within misuse detection context
    Sabhnani, M
    Serpen, G
    [J]. MLMTA'03: INTERNATIONAL CONFERENCE ON MACHINE LEARNING; MODELS, TECHNOLOGIES AND APPLICATIONS, 2003, : 209 - 215
  • [14] Evaluating Machine Learning Algorithms for Anomaly Detection in Clouds
    Gulenko, Anton
    Wallschlaeger, Marcel
    Schmidt, Florian
    Kao, Odej
    Liu, Feng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2716 - 2721
  • [15] Anomaly Detection in ICS Datasets with Machine Learning Algorithms
    Mubarak, Sinil
    Habaebi, Mohamed Hadi
    Islam, Md Rafiqul
    Rahman, Farah Diyana Abdul
    Tahir, Mohammad
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 37 (01): : 33 - 46
  • [16] Machine Learning Methods for Anomaly Detection in IoT Networks, with Illustrations
    Bonandrini, Vassia
    Bercher, Jean-Francois
    Zangar, Nawel
    [J]. MACHINE LEARNING FOR NETWORKING (MLN 2019), 2020, 12081 : 287 - 295
  • [17] A machine learning approach for imputation and anomaly detection in IoT environment
    Vangipuram, Radhakrishna
    Gunupudi, Rajesh Kumar
    Puligadda, Veereswara Kumar
    Vinjamuri, Janaki
    [J]. EXPERT SYSTEMS, 2020, 37 (05)
  • [18] Application of advanced machine learning algorithms for anomaly detection and quantitative prediction in protein A chromatography
    Tiwari, Anamika
    Bansode, Vikrant
    Rathore, Anurag S.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2022, 1682
  • [19] Application of machine learning in BGP anomaly detection
    Dai, Xianbo
    Wang, Na
    Wang, Wenjuan
    [J]. 2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [20] Application of machine learning algorithms in early prediction of diabetes dataset
    Dou, Yifeng
    [J]. ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2022, 18 : 46 - 47