A novel approach using incremental oversampling for data stream mining

被引:5
|
作者
Anupama, N. [1 ]
Jena, Sudarson [2 ]
机构
[1] GITAM Univ, Hyderabad, India
[2] Sambalpur Univ, Inst Informat Technol, Sambalpur, India
关键词
Knowledge discovery; Data streams; Imbalanced data; Oversampling; Increment over sampling for data streams (IOSDS); CLASSIFICATION;
D O I
10.1007/s12530-018-9249-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream mining is very popular in recent years with advanced electronic devices generating continuous data streams. The performance of standard learning algorithms is been compromised with imbalance nature present in real world data streams. In this paper we propose a novel algorithm dubbed as increment over sampling for data streams (IOSDS) which uses an unique over sampling technique to almost balance the data sets to minimize the effect of imbalance in stream mining process. The experimental analysis is conducted on 15 data chunks of data streams with varied sizes and different imbalance ratios. The results suggests that the proposed IOSDS algorithm improves the knowledge discovery over benchmark algorithms like C4.5 and Hoeffding tree in terms of standard performance measures namely accuracy, AUC, precision, recall and F-measure.
引用
收藏
页码:351 / 362
页数:12
相关论文
共 50 条
  • [1] A novel approach using incremental oversampling for data stream mining
    N. Anupama
    Sudarson Jena
    [J]. Evolving Systems, 2019, 10 : 351 - 362
  • [2] A novel approach for mining frequent patterns from incremental data
    Jindal, Rajni
    Borah, Malaya Dutta
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (03) : 244 - 264
  • [3] A novel approach for data stream maximal frequent itemsets mining
    [J]. Xu, Chong-Huan (talentxch@163.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10):
  • [4] Incremental learning framework for mining big data stream
    Eisa, Alaa
    EL-Rashidy, Nora
    Alshehri, Mohammad Dahman
    El-Bakry, Hazem M.
    Abdelrazek, Samir
    [J]. Computers, Materials and Continua, 2022, 71 (02): : 2901 - 2921
  • [5] Incremental Learning Framework for Mining Big Data Stream
    Eisa, Alaa
    EL-Rashidy, Nora
    Alshehri, Mohammad Dahman
    El-bakry, Hazem M.
    Abdelrazek, Samir
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02): : 2901 - 2921
  • [6] SPAMS: A Novel Incremental Approach for Sequential Pattern Mining in Data Streams
    Vinceslas, Lionel
    Symphor, Jean-Emile
    Mancheron, Alban
    Poncelet, Pascal
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND MANAGEMENT, 2010, 292 : 201 - 216
  • [7] Classifying Sonar Signals Using an Incremental Data Stream Mining Methodology with Conflict Analysis
    Fong, Simon
    Deb, Suash
    Thampi, Sabu
    [J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 171 - 182
  • [8] A Pristine Clean Cabalistic Foruity Strategize Based Approach for Incremental Data Stream Privacy Preserving Data Mining
    Gitanjali, J.
    Indumathi, J.
    Iyengar, N. Ch. Sriman Narayana
    [J]. 2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 400 - 405
  • [9] Novel Approach for Generating the Key of Stream Cipher System Using Random Forest Data Mining Algorithm
    Ali, Samaher Hussein
    [J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2014, : 259 - 269
  • [10] GraSeq: A novel approximate mining approach of sequential patterns over data stream
    Li, Haifeng
    Chen, Hong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 401 - +