Mahalanobis Distance Metric Learning Algorithm for Instance-based Data Stream Classification

被引:0
|
作者
Rivero Perez, Jorge Luis [1 ]
Ribeiro, Bernardete [2 ]
Perez, Carlos Morell [3 ]
机构
[1] Univ Coimbra, Fac Sci & Technol, Dept Informat Engn, Coimbra, Portugal
[2] Univ Coimbra, CISUC, Dept Informat Engn, Coimbra, Portugal
[3] Cent Univ Las Villas, Fac Comp Sci, Dept Comp Sci, Santa Clara, Cuba
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the massive data challenges nowadays and the rapid growing of technology, stream mining has recently received considerable attention. To address the large number of scenarios in which this phenomenon manifests itself suitable tools are required in various research fields. Instance-based data stream algorithms generally employ the Euclidean distance for the classification task underlying this problem. A novel way to look into this issue is to take advantage of a more flexible metric due to the increased requirements imposed by the data stream scenario. In this paper we present a new algorithm that learns a Mahalanobis metric using similarity and dissimilarity constraints in an online manner. This approach hybridizes a Mahalanobis distance metric learning algorithm and a k-NN data stream classification algorithm with concept drift detection. First, some basic aspects of Mahalanobis distance metric learning are described taking into account key properties as well as online distance metric learning algorithms. Second, we implement specific evaluation methodologies and comparative metrics such as Q statistic for data stream classification algorithms. Finally, our algorithm is evaluated on different datasets by comparing its results with one of the best instance-based data stream classification algorithm of the state of the art. The results demonstrate that our proposal is better in some scenarios and has shown to be competitive in others.
引用
收藏
页码:1857 / 1862
页数:6
相关论文
共 50 条
  • [1] Classification by instance-based learning algorithm
    Bao, YG
    Tsuchiya, E
    Ishii, N
    Du, XY
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 133 - 140
  • [2] Learning a Mahalanobis distance metric for data clustering and classification
    Xiang, Shiming
    Nie, Feiping
    Zhang, Changshui
    [J]. PATTERN RECOGNITION, 2008, 41 (12) : 3600 - 3612
  • [3] DISTANCE METRICS FOR INSTANCE-BASED LEARNING
    SALZBERG, S
    [J]. LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 542 : 398 - 408
  • [4] Learning Mahalanobis Distance Metric: Considering Instance Disturbance Helps
    Ye, Han-Jia
    Zhan, De-Chuan
    Si, Xue-Min
    Jiang, Yuan
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3315 - 3321
  • [5] An efficient algorithm for instance-based learning on data streams
    Beringer, Juergen
    Huellermeier, Eyke
    [J]. ADVANCES IN DATA MINING: THEORETICAL ASPECTS AND APPLICATIONS, PROCEEDINGS, 2007, 4597 : 34 - +
  • [6] A Scalable Algorithm for Learning a Mahalanobis Distance Metric
    Kim, Junae
    Shen, Chunhua
    Wang, Lei
    [J]. COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 299 - 310
  • [7] An integrated instance-based learning algorithm
    Wilson, DR
    Martinez, TR
    [J]. COMPUTATIONAL INTELLIGENCE, 2000, 16 (01) : 1 - 28
  • [8] A data-dependent distance measure for transductive instance-based learning
    Lundell, Jared
    Ventura, Dan
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 3319 - 3324
  • [9] Improved Distance Functions for Instance-Based Text Classification
    El Hindi, Khalil
    Abu Shawar, Bayan
    Aljulaidan, Reem
    Alsalamn, Hussien
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020
  • [10] A cooperative coevolutionary algorithm for instance selection for instance-based learning
    Nicolás García-Pedrajas
    Juan Antonio Romero del Castillo
    Domingo Ortiz-Boyer
    [J]. Machine Learning, 2010, 78 : 381 - 420