Labelled Classifier with Weighted Drift Trigger Model using Machine Learning for Streaming Data Analysis

被引:0
|
作者
Prasad, Gollanapalli, V [1 ]
Rao, S. Krishna Mohan [2 ]
Sharma, Kapil [3 ]
Venkatadri, M. [4 ]
Krishna, B. Rama [1 ]
机构
[1] GNI Tech Campus, Hyderabad, Telangana, India
[2] Sidartha Inst Enginiring Techol, Hydeabad, Telangana, India
[3] Amity Univ, Comp Sci & Engn, Gwalior, India
[4] Amity Sch Engn & Technol ASET Gwalior, Gwalior, India
关键词
Data Clustering; Data Classification; Data Stream Mining; Streaming Data; Drift Detection; Drift Trigger Model; Labelled Classifier; ENSEMBLE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The term "data-drift " refers to a difference between the data used to test and validate a model and the data used to deploy it in production. It is possible for data to drift for a variety of reasons. The track of time is an important consideration. Data mining procedures such as classification, clustering, and data stream mining are critical to information extraction and knowledge discovery because of the possibility for significant data type and dimensionality changes over time. The amount of research on mining and analyzing real-time streaming data has risen dramatically in the recent decade. As the name suggests, it's a stream of data that originates from a number of sources. Analyzing information assets has taken on increased significance in the quest for real-time analytics fulfilment. Traditional mining methods are no longer effective since data is acting in a different way. Aside from storage and temporal constraints, data streams provide additional challenges because just a single pass of the data is required. The dynamic nature of data streams makes it difficult to run any mining method, such as classification, clustering, or indexing, in a single iteration of data. This research identifies concept drift in streaming data classification. For data classification techniques, a Labelled Classifier with Weighted Drift Trigger Model (LCWDTM) is proposed that provides categorization and the capacity to tackle concept drift difficulties. The proposed classifier efficiency is contrasted with the existing classifiers and the results represent that the proposed model in data drift detection is accurate and efficient.
引用
收藏
页码:349 / 356
页数:8
相关论文
共 50 条
  • [1] Incremental Bayesian Classifier for Streaming Data with Concept Drift
    Wu, Peng
    Xiong, Ning
    Li, Gang
    Lv, Jinrui
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 509 - 518
  • [2] A systematic review on detection and adaptation of concept drift in streaming data using machine learning techniques
    Arora, Shruti
    Rani, Rinkle
    Saxena, Nitin
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 14 (04)
  • [3] Active Learning Classifier for Streaming Data
    Wozniak, Michal
    Cyganek, Boguslaw
    Kasprzak, Andrzej
    Ksieniewicz, Pawel
    Walkowiak, Krzysztof
    Hybrid Artificial Intelligent Systems, 2016, 9648 : 186 - 197
  • [4] Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift
    Yuan, Zhehu
    Sun, Yinqi
    Shasha, Dennis
    ALGORITHMS, 2023, 16 (06)
  • [5] Robust ecological analysis of camera trap data labelled by a machine learning model
    Whytock, Robin C.
    Swiezewski, Jedrzej
    Zwerts, Joeri A.
    Pambo, Aurelie Flore Koumba
    Rogala, Marek
    Bahaa-el-din, Laila
    Boekee, Kelly
    Brittain, Stephanie
    Cardoso, Anabelle W.
    Henschel, Philipp
    Lehmann, David
    Momboua, Brice
    Opepa, Cisquet Kiebou
    Orbell, Christopher
    Pitman, Ross T.
    Robinson, Hugh S.
    Abernethy, Katharine A.
    METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (06): : 1080 - 1092
  • [6] SETL: a transfer learning based dynamic ensemble classifier for concept drift detection in streaming data
    Arora, Shruti
    Rani, Rinkle
    Saxena, Nitin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3417 - 3432
  • [7] Concept Drift Detection in Streams of Labelled Data Using the Restricted Boltzmann Machine
    Jaworski, Maciej
    Duda, Piotr
    Rutkowski, Leszek
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [8] Weighted variable kernel support vector machine classifier for metabolomics data analysis
    Huang, Xin
    Xu, Qing-Song
    Yun, Yong-Huan
    Huang, Jian-Hua
    Liang, Yi-Zeng
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 146 : 365 - 370
  • [9] Machine Learning Model Drift Detection Via Weak Data Slices
    Ackermant, Samuel
    Dube, Parijat
    Farchi, Eitan
    Raz, Orna
    Zalmanovici, Marcel
    2021 IEEE/ACM THIRD INTERNATIONAL WORKSHOP ON DEEP LEARNING FOR TESTING AND TESTING FOR DEEP LEARNING (DEEPTEST 2021), 2021, : 1 - 8
  • [10] Machine learning on sequential data using a recurrent weighted average
    Ostmeyer, Jared
    Cowell, Lindsay
    NEUROCOMPUTING, 2019, 331 : 281 - 288