A Novel Outlier Detection with Feature Selection Enabled Streaming Data Classification

被引:1
|
作者
Rajakumar, R. [1 ]
Devi, S. Sathiya [2 ]
机构
[1] Anna Univ, Chennai 600025, India
[2] Univ Coll Engn, Anna Univ, BIT Campus, Trichirappali 620024, India
来源
关键词
Streaming data classi fi cation; outlier removal; feature selection; machine learning; metaheuristics; BIG DATA;
D O I
10.32604/iasc.2023.028889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the advancements in information technologies, massive quantity of data is being produced by social media, smartphones, and sensor devices. The investigation of data stream by the use of machine learning (ML) approaches to address regression, prediction, and classification problems have received consid-erable interest. At the same time, the detection of anomalies or outliers and feature selection (FS) processes becomes important. This study develops an outlier detec-tion with feature selection technique for streaming data classification, named ODFST-SDC technique. Initially, streaming data is pre-processed in two ways namely categorical encoding and null value removal. In addition, Local Correla-tion Integral (LOCI) is used which is significant in the detection and removal of outliers. Besides, red deer algorithm (RDA) based FS approach is employed to derive an optimal subset of features. Finally, kernel extreme learning machine (KELM) classifier is used for streaming data classification. The design of LOCI based outlier detection and RDA based FS shows the novelty of the work. In order to assess the classification outcomes of the ODFST-SDC technique, a series of simulations were performed using three benchmark datasets. The experimental results reported the promising outcomes of the ODFST-SDC technique over the recent approaches.
引用
收藏
页码:2101 / 2116
页数:16
相关论文
共 50 条
  • [1] Unsupervised Feature Selection for Outlier Detection on Streaming Data to Enhance Network Security
    Heigl, Michael
    Weigelt, Enrico
    Fiala, Dalibor
    Schramm, Martin
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [2] A novel feature selection approach for intrusion detection data classification
    Ambusaidi, Mohammed A.
    He, Xiangjian
    Tan, Zhiyuan
    Nanda, Priyadarsi
    Lu, Liang Fu
    Nagar, Upasana T.
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 82 - 89
  • [3] Outlier detection in classification based on feature-selection-based regression
    Su, Jinxia
    Liu, Qiwen
    Cui, Jingke
    Knowledge and Information Systems, 2025, 67 (02) : 1399 - 1414
  • [4] An Ensemble Filter Feature Selection Method and Outlier Detection Method for Multiclass Classification
    Ndirangu, Dalton
    Mwangi, Waweru
    Nderu, Lawrence
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 373 - 379
  • [5] Outlier Detection in Streaming Data A research Perspective
    Chugh, Neeraj
    Chugh, Mitali
    Agarwal, Alok
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 429 - 432
  • [6] Clustering Enabled Classification using Ensemble Feature Selection for Intrusion Detection
    Salo, Fadi
    Injadat, MohammadNoor
    Moubayed, Abdallah
    Nassif, Ali Bou
    Essex, Aleksander
    2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 276 - 281
  • [7] Exploiting the Outcome of Outlier Detection for Novel Attack Pattern Recognition on Streaming Data
    Heigl, Michael
    Weigelt, Enrico
    Urmann, Andreas
    Fiala, Dalibor
    Schramm, Martin
    ELECTRONICS, 2021, 10 (17)
  • [8] Outlier Detection Ensemble with Embedded Feature Selection
    Cheng, Li
    Wang, Yijie
    Liu, Xinwang
    Li, Bin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3503 - 3512
  • [9] Covariance Based Outlier Detection with Feature Selection
    Zwilling, Chris E.
    Wang, Michelle Y.
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 2606 - 2609
  • [10] Online Feature Selection with Streaming Features for Classification
    You, Dian-Long
    Guo, Song
    Zhao, Chun-Hui
    Yuan, Fu-Yong
    Shen, Li-Min
    Chen, Zhen
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (02): : 321 - 332