Imbalance Data Classification Using Local Mahalanobis Distance Learning Based on Nearest Neighbor

被引:0
|
作者
Siddappa N.G. [1 ]
Kampalappa T. [1 ]
机构
[1] Visvesvaraya Technological University, Belagavi
关键词
Anomalies; Classification; Imbalance data; Information loss; Local Mahalanobis distance learning; Nearest neighbor; Sampling;
D O I
10.1007/s42979-020-0085-x
中图分类号
学科分类号
摘要
In the dataset, any one of its classes is normally outnumbered by other classes and is known as class imbalance data. Many standard learning algorithms face the classification problem in performance due to imbalance data. The issues can be solved by many existing conventional methods such as cost-sensitive, sampling or ensemble methods. But these methods alter the original data distribution, which leads to loss of useful information of the users and it may cause unexpected errors or increase the problem of overfitting. In this research, local Mahalanobis distance learning (LMDL) method is applied in the nearest neighbor (NN) for improving the performance of the classification in the imbalance dataset. The multiple distance metrics are used in the LMDL to investigate the data effectively and obtain the relevant features based on the analysis. The distance metric uses the original data for learning the prototype and support the NN. A number of experiments on various datasets are conducted for validating the quality as well as the efficiency of the proposed LMDL method. The experimental results stated that the proposed LMDL achieved nearly 82% in E-coli dataset, 94% in breast cancer dataset and 98% in Iris dataset for all metrics such as accuracy, precision, recall and F-measure. © 2020, The Author(s).
引用
收藏
相关论文
共 50 条
  • [1] Generative Local Metric Learning for Nearest Neighbor Classification
    Noh, Yung-Kyun
    Zhang, Byoung-Tak
    Lee, Daniel D.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 106 - 118
  • [2] Nearest neighbor classification using cam weighted distance
    Zhou, CY
    Chen, YQ
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 100 - 109
  • [3] Data compression and local metrics for nearest neighbor classification
    Ricci, F
    Avesani, P
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (04) : 380 - 384
  • [4] Distance Metric Learning for Large Margin Nearest Neighbor Classification
    Weinberger, Kilian Q.
    Saul, Lawrence K.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 207 - 244
  • [5] Learning a Mahalanobis distance metric for data clustering and classification
    Xiang, Shiming
    Nie, Feiping
    Zhang, Changshui
    PATTERN RECOGNITION, 2008, 41 (12) : 3600 - 3612
  • [6] Spectral Clustering Approach with K-Nearest Neighbor and Weighted Mahalanobis Distance for Data Mining
    Yin, Lifeng
    Lv, Lei
    Wang, Dingyi
    Qu, Yingwei
    Chen, Huayue
    Deng, Wu
    ELECTRONICS, 2023, 12 (15)
  • [7] Nearest Neighbor Method Based on Local Distribution for Classification
    Mao, Chengsheng
    Hu, Bin
    Moore, Philip
    Su, Yun
    Wang, Manman
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 239 - 250
  • [8] A Correlation-Based Distance Function for Nearest Neighbor Classification
    Rodriguez, Yanet
    De Baets, Bernard
    Garcia, Maria M.
    Morell, Carlos
    Grau, Ricardo
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 284 - +
  • [9] HIERARCHICAL DISTANCE METRIC LEARNING FOR LARGE MARGIN NEAREST NEIGHBOR CLASSIFICATION
    Sun, Shiliang
    Chen, Qiaona
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (07) : 1073 - 1087
  • [10] Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification
    Wang, Meng
    Gao, Feng
    Dong, Junyu
    Li, Heng-Chao
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61