A Novel Outlier Detection with Feature Selection Enabled Streaming Data Classification

被引:1
|
作者
Rajakumar, R. [1 ]
Devi, S. Sathiya [2 ]
机构
[1] Anna Univ, Chennai 600025, India
[2] Univ Coll Engn, Anna Univ, BIT Campus, Trichirappali 620024, India
来源
关键词
Streaming data classi fi cation; outlier removal; feature selection; machine learning; metaheuristics; BIG DATA;
D O I
10.32604/iasc.2023.028889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the advancements in information technologies, massive quantity of data is being produced by social media, smartphones, and sensor devices. The investigation of data stream by the use of machine learning (ML) approaches to address regression, prediction, and classification problems have received consid-erable interest. At the same time, the detection of anomalies or outliers and feature selection (FS) processes becomes important. This study develops an outlier detec-tion with feature selection technique for streaming data classification, named ODFST-SDC technique. Initially, streaming data is pre-processed in two ways namely categorical encoding and null value removal. In addition, Local Correla-tion Integral (LOCI) is used which is significant in the detection and removal of outliers. Besides, red deer algorithm (RDA) based FS approach is employed to derive an optimal subset of features. Finally, kernel extreme learning machine (KELM) classifier is used for streaming data classification. The design of LOCI based outlier detection and RDA based FS shows the novelty of the work. In order to assess the classification outcomes of the ODFST-SDC technique, a series of simulations were performed using three benchmark datasets. The experimental results reported the promising outcomes of the ODFST-SDC technique over the recent approaches.
引用
收藏
页码:2101 / 2116
页数:16
相关论文
共 50 条
  • [41] Unsupervised Feature Selection for Outlier Detection by Modelling Hierarchical Value-Feature Couplings
    Pang, Guansong
    Cao, Longbing
    Chen, Ling
    Liu, Huan
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 410 - 419
  • [42] Feature Grouping-based Outlier Detection upon Streaming Trajectories (Extended abstract)
    Mao, Jiali
    Wang, Tao
    Jin, Cheqing
    Zhou, Aoying
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1745 - 1746
  • [43] Feature Interaction for Streaming Feature Selection
    Zhou, Peng
    Li, Peipei
    Zhao, Shu
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4691 - 4702
  • [44] Outlier detection with streaming dyadic decomposition
    Gupta, Chetan
    Grossman, Robert
    ADVANCES IN DATA MINING: THEORETICAL ASPECTS AND APPLICATIONS, PROCEEDINGS, 2007, 4597 : 77 - +
  • [45] A Novel Outlier Detection Method for Multivariate Data
    Almardeny, Yahya
    Boujnah, Noureddine
    Cleary, Frances
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4052 - 4062
  • [46] Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection
    Azmandian, Fatemeh
    Yilmazer, Ayse
    Dy, Jennifer G.
    Aslam, Javed A.
    Kaeli, David R.
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (03) : 408 - 422
  • [47] Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection
    Fatemeh Azmandian
    Ayse Yilmazer
    Jennifer G. Dy
    Javed A. Aslam
    David R. Kaeli
    Journal of Computer Science and Technology, 2014, 29 : 408 - 422
  • [48] Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection
    Fatemeh Azmandian
    Ayse Yilmazer
    Jennifer G.Dy
    Javed A.Aslam
    David R.Kaeli
    Journal of Computer Science & Technology, 2014, 29 (03) : 408 - 422
  • [49] A Novel Approach for Outlier Detection in Multivariate Data
    Afzal, Saima
    Afzal, Ayesha
    Amin, Muhammad
    Saleem, Sehar
    Ali, Nouman
    Sajid, Muhammad
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [50] Towards Online Concept Drift Detection with Feature Selection for Data Stream Classification
    Hammoodi, Mahmood
    Stahl, Frederic
    Tennant, Mark
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1549 - 1550