An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction

被引:0
|
作者
Jedrzejowicz, Joanna [1 ]
Jedrzejowicz, Piotr [2 ]
机构
[1] Univ Gdansk, Inst Informat, Fac Math Phys & Informat, PL-80308 Gdansk, Poland
[2] Gdynia Maritime Univ, Dept Informat Syst, Gdynia, Poland
关键词
FUZZY; ALGORITHM;
D O I
10.3233/JIFS-169127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.
引用
收藏
页码:1289 / 1296
页数:8
相关论文
共 50 条
  • [21] SSV criterion based discretization for naive Bayes classifiers
    Grabczewski, K
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 574 - 579
  • [22] Human activity classification using Decision Tree and Naive Bayes classifiers
    Maswadi, Kholoud
    Ghani, Norjihan Abdul
    Hamid, Suraya
    Rasheed, Muhammads Babar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 21709 - 21726
  • [23] COMPARISON OF NAIVE BAYES AND SUPPORT VECTOR MACHINE CLASSIFIERS ON DOCUMENT CLASSIFICATION
    Moe, Zun Hlaing
    San, Thida
    Khin, Mie Mie
    Tin, Hlaing May
    [J]. 2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 466 - 467
  • [24] Classification for Authorship of Tweets by Comparing Logistic Regression and Naive Bayes Classifiers
    Aborisade, Opeyemi Mulikat
    Anwar, Mohd
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 269 - 276
  • [25] Online Naive Bayes Classification for Network Intrusion Detection
    Gumus, Fatma
    Sakar, C. Okan
    Erdem, Zeki
    Kursun, Olcay
    [J]. 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 670 - 674
  • [26] Ensemble Classifiers based on Kernel ICA for Cancer Data Classification
    Zhou, Jin
    Lin, Yongzheng
    Chen, Yuehui
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1596 - 1600
  • [27] Local distance-based classification
    Laguia, Manuel
    Castro, Juan Luis
    [J]. KNOWLEDGE-BASED SYSTEMS, 2008, 21 (07) : 692 - 703
  • [28] Distance-based classification methods
    Ekin, O
    Hammer, PL
    Kogan, A
    Winter, P
    [J]. INFOR, 1999, 37 (03) : 337 - 352
  • [29] Ensemble Classifiers Based on Kernel PCA for Cancer Data Classification
    Zhou, Jin
    Pan, Yuqi
    Chen, Yuehui
    Liu, Yang
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 955 - +
  • [30] The Distance-Based Balancing Ensemble Method for Data With a High Imbalance Ratio
    Chen, Dong
    Wang, Xiao-Jun
    Zhou, Changjun
    Wang, Bin
    [J]. IEEE ACCESS, 2019, 7 : 68940 - 68956