k-Nearest Neighbour Using Ensemble Clustering Based on Feature Selection Approach to Learning Relational Data

被引:0
|
作者
Alfred, Rayner [1 ]
Shin, Kung Ke [1 ]
Sainin, Mohd Shamrie [2 ]
On, Chin Kim [1 ]
Pandiyan, Paulraj Murugesa [3 ]
Ibrahim, Ag Asri Ag [1 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Kota Kinabalu, Sabah, Malaysia
[2] Univ Utara Malaysia, Sch Comp, Changlun, Kedah, Malaysia
[3] Univ Malaysia Perlis, Sch Mechatron Engn, Arau, Perlis, Malaysia
关键词
Relational data mining; k-Nearest Neighbours; Classification; Ensembles; Feature selection; Genetic Algorithm; REDUCTION;
D O I
10.1007/978-3-319-49073-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the growing amount of data generated and stored in relational databases, relational learning has attracted the interest of researchers in recent years. Many approaches have been developed in order to learn relational data. One of the approaches used to learn relational data is Dynamic Aggregation of Relational Attributes (DARA). The DARA algorithm is designed to summarize relational data with one-to-many relations. However, DARA suffers a major drawback when the cardinalities of attributes are very high because the size of the vector space representation depends on the number of unique values that exist for all attributes in the dataset. A feature selection process can be introduced to overcome this problem. These selected features can be further optimized to achieve a good classification result. Several clustering runs can be performed for different values of k to yield an ensemble of clustering results. This paper proposes a two-layered genetic algorithm-based feature selection in order to improve the classification performance of learning relational database using a k-NN ensemble classifier. The proposed method involves the task of omitting less relevant features but retaining the diversity of the classifiers so as to improve the performance of the k-NN ensemble. The result shows that the proposed k-NN ensemble is able to improve the performance of traditional k-NN classifiers.
引用
收藏
页码:322 / 331
页数:10
相关论文
共 50 条
  • [1] K-nearest neighbour-based feature selection using hyperspectral data
    Pal, Mahesh
    Charan, Teja B.
    Poriya, Akshay
    [J]. REMOTE SENSING LETTERS, 2021, 12 (02) : 128 - 137
  • [2] A multilevel k-nearest neighbour learning algorithm based on k-means clustering
    Ying, Xu
    [J]. 2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 250 - 253
  • [3] Using k-nearest neighbor and feature selection as an improvement to hierarchical clustering
    Mylonas, P
    Wallace, M
    Kollias, S
    [J]. METHODS AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3025 : 191 - 200
  • [4] A New Feature Selection Method Based on K-Nearest Neighbor Approach
    Wang, Xianchang
    Zhang, Lishi
    Ma, Yonggang
    [J]. PROCEEDINGS OF THE 2016 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND MEDICINE (EMCM 2016), 2017, 59 : 657 - 660
  • [5] An evaluation of k-nearest neighbour imputation using Likert data
    Jönsson, P
    Wohlin, C
    [J]. 10TH INTERNATIONAL SYMPOSIUM ON SOFTWARE METRICS, PROCEEDINGS, 2004, : 108 - 118
  • [6] Improved AURA k-Nearest Neighbour approach
    Weeks, M
    Hodge, V
    O'Keefe, S
    Austin, J
    Lees, K
    [J]. ARTIFICIAL NEURAL NETS PROBLEM SOLVING METHODS, PT II, 2003, 2687 : 663 - 670
  • [7] Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Nitish Kumar
    Swain, Amitav
    Rath, Santanu Kumar
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 301 - 310
  • [8] Feature extraction for the k-nearest neighbour classifier with genetic programming
    Bot, MCJ
    [J]. GENETIC PROGRAMMING, PROCEEDINGS, 2001, 2038 : 256 - 267
  • [9] K-nearest oracle for dynamic ensemble selection
    Ko, Albert Hung-Ren
    Sabourin, Robert
    Britto, Alceu de Souza, Jr.
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 422 - +
  • [10] A proposed model based on k-nearest neighbour classifier with feature selection techniques to control and forecast plant disease
    Imran, Inas Ismael
    Ali, Rawaa Hamza
    Jameel, Shymaa Mohammed
    Jaleel, Refed Adnan
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2024, 15 (3-4) : 306 - 313