Using k-nearest neighbor and feature selection as an improvement to hierarchical clustering

被引:0
|
作者
Mylonas, P [1 ]
Wallace, M [1 ]
Kollias, S [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Zografos, Athens, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering of data is a difficult problem that is related to various fields and applications. Challenge is greater, as input space dimensions become larger and feature scales are different from each other. Hierarchical clustering methods are more flexible than their partitioning counterparts, as they do not need the number of clusters as input. Still, plain hierarchical clustering does not provide a satisfactory framework for extracting meaningful results in such cases. Major drawbacks have to be tackled, such as curse of dimensionality and initial error propagation, as well as complexity and data set size issues. In this paper we propose an unsupervised extension to hierarchical clustering in the means of feature selection, in order to overcome the first drawback, thus increasing the robustness of the whole algorithm. The results of the application of this clustering to a portion of dataset in question are then refined and extended to the whole dataset through a classification step, using k-nearest neighbor classification technique, in order to tackle the latter two problems. The performance of the proposed methodology is demonstrated through the application to a variety of well known publicly available data sets.
引用
收藏
页码:191 / 200
页数:10
相关论文
共 50 条
  • [31] K-Nearest Neighbor Classifier for Uncertain Data in Feature Space
    Lim, Sung-Yeon
    Ko, Changwan
    Jeong, Young-Seon
    Baek, Jaeseung
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2023, 22 (04): : 414 - 421
  • [32] Feature-weighted K-nearest neighbor algorithm with SVM
    Chen, Zhen-Zhou
    Li, Lei
    Yao, Zheng-An
    Zhongshan Daxue Xuebao/Acta Scientiarum Natralium Universitatis Sunyatseni, 2005, 44 (01): : 17 - 20
  • [33] Hepatocellular Carcinoma Diagnosis Based on Ultrasound Images Using Feature Selection Techniques and K-nearest Neighbor Classifier
    Nanvaee, Fatemeh Azimi
    Setayeshi, Saeed
    HEPATITIS MONTHLY, 2023, 23 (01)
  • [34] k-Nearest Neighbour Using Ensemble Clustering Based on Feature Selection Approach to Learning Relational Data
    Alfred, Rayner
    Shin, Kung Ke
    Sainin, Mohd Shamrie
    On, Chin Kim
    Pandiyan, Paulraj Murugesa
    Ibrahim, Ag Asri Ag
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 322 - 331
  • [35] Improvement of k-nearest neighbor algorithm based on double filtering
    Ma, Chun Jie
    Ding, Zheng Sheng
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1567 - 1570
  • [36] Feature selection for identifying critical variables of principal components based on K-nearest neighbor rule
    Li, Yun
    Lu, Bao-Liang
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 193 - 204
  • [37] Automatic Feature Selection for Modified K-Nearest Neighbor to Predict Student's Academic Performance
    Wafi, Muhammad
    Faruq, Umar
    Supianto, Afif
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 44 - 48
  • [38] Optimal Feature Selection Based on Discrete Grasshopper Optimization Algorithm and K-nearest Neighbor Classifier
    Qi, Yu-Liang
    Wang, Jie-Sheng
    Song, Yu-Wei
    Wang, Yu-Cai
    Song, Hao-Ming
    Hou, Jia-Ning
    ENGINEERING LETTERS, 2024, 32 (01) : 89 - 100
  • [39] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [40] Feature Based Classification of Nuclear Receptors and Their Subfamilies Using Fuzzy K-Nearest Neighbor
    Tiwari, Arvind Kumar
    Srivastava, Rajeev
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 24 - 28