An Optimized k-NN Approach for Classification on Imbalanced Datasets with Missing Data

被引:8
|
作者
Ozan, Ezgi Can [1 ]
Riabchenko, Ekaterina [1 ]
Kiranyaz, Serkan [2 ]
Gabbouj, Moncef [1 ]
机构
[1] Tampere Univ Technol, Tampere, Finland
[2] Qatar Univ, Coll Engn, Dept Elect Engn, Doha, Qatar
来源
关键词
k-NN classifier; Missing data; Imbalanced datasets;
D O I
10.1007/978-3-319-46349-0_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe our solution for the machine learning prediction challenge in IDA 2016. For the given problem of 2-class classification on an imbalanced dataset with missing data, we first develop an imputation method based on k-NN to estimate the missing values. Then we define a tailored representation for the given problem as an optimization scheme, which consists of learned distance and voting weights for k-NN classification. The proposed solution performs better in terms of the given challenge metric compared to the traditional classification methods such as SVM, AdaBoost or Random Forests.
引用
收藏
页码:387 / 392
页数:6
相关论文
共 50 条
  • [1] Handling Imbalanced Dataset Using SVM and k-NN Approach
    Wah, Yap Bee
    Abd Rahman, Hezlin Aryani
    He, Haibo
    Bulgiba, Awang
    [J]. ADVANCES IN INDUSTRIAL AND APPLIED MATHEMATICS, 2016, 1750
  • [2] Fast k-NN classification for multichannel image data
    Warfield, S
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (07) : 713 - 721
  • [3] K-Nearest Neighbor (K-NN) based Missing Data Imputation
    Murti, Della Murbarani Prawidya
    Wibawa, Aji Prasetya
    Akbar, Muhammad Iqbal
    Ianto, Utomo Puj
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 83 - 88
  • [4] Feature projection k-NN classifier model for imbalanced and incomplete medical data
    Porwik, Piotr
    Orczyk, Tomasz
    Lewandowski, Marcin
    Cholewa, Marcin
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2016, 36 (04) : 644 - 656
  • [5] Investigation of the Impact of Missing Value Imputation Methods on the k-NN Classification Accuracy
    Orczyk, Tomasz
    Porwik, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 557 - 565
  • [6] An Empirical Analysis of Data Reduction Techniques for k-NN Classification
    Eleftheriadis, Stylianos
    Evangelidis, Georgios
    Ougiaroglou, Stefanos
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 83 - 97
  • [7] Anonymizing k-NN Classification on MapReduce
    Bazai, Sibghat Ullah
    Jang-Jaccard, Julian
    Wang, Ruili
    [J]. MOBILE NETWORKS AND MANAGEMENT (MONAMI 2017), 2018, 235 : 364 - 377
  • [8] An Approach to Speech Emotion Classification Using k-NN and SVMs
    Disne SIVALINGAM
    [J]. Instrumentation, 2021, (03) : 36 - 45
  • [9] Phoneme Classification and Lattice Rescoring Based on a k-NN Approach
    Golipour, Ladan
    O'Shaughnessy, Douglas
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1954 - 1957
  • [10] Exact k-NN queries on clustered SVD datasets
    Thomasian, A
    Li, Y
    Zhang, LJ
    [J]. INFORMATION PROCESSING LETTERS, 2005, 94 (06) : 247 - 252