A Novel Outlier Detection with Feature Selection Enabled Streaming Data Classification

被引:1
|
作者
Rajakumar, R. [1 ]
Devi, S. Sathiya [2 ]
机构
[1] Anna Univ, Chennai 600025, India
[2] Univ Coll Engn, Anna Univ, BIT Campus, Trichirappali 620024, India
来源
关键词
Streaming data classi fi cation; outlier removal; feature selection; machine learning; metaheuristics; BIG DATA;
D O I
10.32604/iasc.2023.028889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the advancements in information technologies, massive quantity of data is being produced by social media, smartphones, and sensor devices. The investigation of data stream by the use of machine learning (ML) approaches to address regression, prediction, and classification problems have received consid-erable interest. At the same time, the detection of anomalies or outliers and feature selection (FS) processes becomes important. This study develops an outlier detec-tion with feature selection technique for streaming data classification, named ODFST-SDC technique. Initially, streaming data is pre-processed in two ways namely categorical encoding and null value removal. In addition, Local Correla-tion Integral (LOCI) is used which is significant in the detection and removal of outliers. Besides, red deer algorithm (RDA) based FS approach is employed to derive an optimal subset of features. Finally, kernel extreme learning machine (KELM) classifier is used for streaming data classification. The design of LOCI based outlier detection and RDA based FS shows the novelty of the work. In order to assess the classification outcomes of the ODFST-SDC technique, a series of simulations were performed using three benchmark datasets. The experimental results reported the promising outcomes of the ODFST-SDC technique over the recent approaches.
引用
收藏
页码:2101 / 2116
页数:16
相关论文
共 50 条
  • [21] Hybrid classification with meta-heuristic-enabled optimal feature selection for thyroid detection
    Bhausaheb, Rajole N.
    Vitthal, Gond J.
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2021, 31 (03) : 1468 - 1485
  • [22] PTAOD: A Novel Framework for Supporting Approximate Outlier Detection Over Streaming Data for Edge Computing
    Zhu, Rui
    Yu, Tiantian
    Tan, Zhiyuan
    Du, Wei
    Zhao, Liang
    Li, Jiajia
    Xia, Xiufeng
    IEEE ACCESS, 2020, 8 (08): : 1475 - 1485
  • [23] On the Improvement of the Isolation Forest Algorithm for Outlier Detection with Streaming Data
    Heigl, Michael
    Anand, Kumar Ashutosh
    Urmann, Andreas
    Fiala, Dalibor
    Schramm, Martin
    Hable, Robert
    ELECTRONICS, 2021, 10 (13)
  • [24] MEOD: Memory-Efficient Outlier Detection on Streaming Data
    Karale, Ankita
    Lazarova, Milena
    Koleva, Pavlina
    Poulkov, Vladimir
    SYMMETRY-BASEL, 2021, 13 (03):
  • [25] Outlier Detection in Streaming Data for Telecommunications and Industrial Applications: A Survey
    Mfondoum, Roland N.
    Ivanov, Antoni
    Koleva, Pavlina
    Poulkov, Vladimir
    Manolova, Agata
    ELECTRONICS, 2024, 13 (16)
  • [26] Unsupervised Outlier Detection in Streaming Data Using Weighted Clustering
    Thakran, Yogita
    Toshniwal, Durga
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 947 - 952
  • [27] Real-time Outlier Detection over Streaming Data
    Yu, Kangqing
    Shi, Wei
    Santoro, Nicola
    Ma, Xiangyu
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 125 - 132
  • [28] Concept Drift Detection on Streaming Data with Dynamic Outlier Aggregation
    Zellner, Ludwig
    Richter, Florian
    Sontheim, Janina
    Maldonado, Andrea
    Seidl, Thomas
    PROCESS MINING WORKSHOPS, ICPM 2020 INTERNATIONAL WORKSHOPS, 2021, 406 : 206 - 217
  • [29] Novel Outlier Detection by Integration of Clustering and Classification
    Tripathy, Sarita
    Sahoo, Laxman
    DATA SCIENCE AND BIG DATA ANALYTICS, 2019, 16 : 169 - 176
  • [30] A Robust AUC Maximization Framework With Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification
    Ren, Ke
    Yang, Haichuan
    Zhao, Yu
    Chen, Wu
    Xue, Mingshan
    Miao, Hongyu
    Huang, Shuai
    Liu, Ji
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (10) : 3072 - 3083