A Novel Outlier Detection with Feature Selection Enabled Streaming Data Classification

被引:1
|
作者
Rajakumar, R. [1 ]
Devi, S. Sathiya [2 ]
机构
[1] Anna Univ, Chennai 600025, India
[2] Univ Coll Engn, Anna Univ, BIT Campus, Trichirappali 620024, India
来源
关键词
Streaming data classi fi cation; outlier removal; feature selection; machine learning; metaheuristics; BIG DATA;
D O I
10.32604/iasc.2023.028889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the advancements in information technologies, massive quantity of data is being produced by social media, smartphones, and sensor devices. The investigation of data stream by the use of machine learning (ML) approaches to address regression, prediction, and classification problems have received consid-erable interest. At the same time, the detection of anomalies or outliers and feature selection (FS) processes becomes important. This study develops an outlier detec-tion with feature selection technique for streaming data classification, named ODFST-SDC technique. Initially, streaming data is pre-processed in two ways namely categorical encoding and null value removal. In addition, Local Correla-tion Integral (LOCI) is used which is significant in the detection and removal of outliers. Besides, red deer algorithm (RDA) based FS approach is employed to derive an optimal subset of features. Finally, kernel extreme learning machine (KELM) classifier is used for streaming data classification. The design of LOCI based outlier detection and RDA based FS shows the novelty of the work. In order to assess the classification outcomes of the ODFST-SDC technique, a series of simulations were performed using three benchmark datasets. The experimental results reported the promising outcomes of the ODFST-SDC technique over the recent approaches.
引用
收藏
页码:2101 / 2116
页数:16
相关论文
共 50 条
  • [31] A Novel Cloud Intrusion Detection System Using Feature Selection and Classification
    Kannan, Anand
    Venkatesan, Karthik Gururajan
    Stagkopoulou, Alexandra
    Li, Sheng
    Krishnan, Sathyavakeeswaran
    Rahman, Arifur
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2015, 11 (04) : 1 - 15
  • [32] Online streaming feature selection for multigranularity hierarchical classification learning
    Wang, Chenxi
    Zhang, Xiaoqing
    Ye, Liqin
    Mao, Yu
    Li, Shaozi
    Lin, Yaojin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (17):
  • [33] Microarray classification with hierarchical data representation and novel feature selection criteria
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 344 - 349
  • [34] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Fang Feng
    Kuan-Ching Li
    Erfu Yang
    Qingguo Zhou
    Lihong Han
    Amir Hussain
    Mingjiang Cai
    Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
  • [35] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267
  • [36] Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification
    Venkataraman, Sivakumar
    Selvaraj, Rajalakshmi
    ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 499 - 514
  • [37] Streaming feature selection algorithms for big data: A survey
    AlNuaimi, Noura
    Masud, Mohammad Mehedy
    Serhani, Mohamed Adel
    Zaki, Nazar
    APPLIED COMPUTING AND INFORMATICS, 2022, 18 (1/2) : 113 - 135
  • [38] Local Feature Selection for Data Classification
    Armanfard, Narges
    Reilly, James P.
    Komeili, Majid
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (06) : 1217 - 1227
  • [39] Enhancing mass spectrometry data analysis: A novel framework for calibration, outlier detection, and classification
    Peng, Weili
    Zhou, Tao
    Chen, Yuanyuan
    PATTERN RECOGNITION LETTERS, 2024, 182 : 1 - 8
  • [40] A Hybrid PSO-MiLOF Approach for Outlier Detection in Streaming Data
    Karate, Ankita
    Lazarova, Milena
    Koleva, Pavlina
    Poulkov, Vladimir
    2020 43RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2020, : 474 - 479