An integrated approach for different attribute types in nearest neighbour classification

被引:1
|
作者
Liu, WZ
机构
来源
KNOWLEDGE ENGINEERING REVIEW | 1996年 / 11卷 / 03期
关键词
D O I
10.1017/S0269888900007906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The basic nearest neighbour algorithm works by storing the training instances and classifying a new case by predicting that it has the same class as its nearest stored instance. To measure the distance between instances, some distance metric needs to be used. In situations when all attributes have numeric values, the conventional nearest neighbour method treats examples as points in feature spaces and uses Euclidean distance as the distance metric. In tasks with only nominal attributes, the simple ''over-lap'' metric is usually used. To handle classification tasks that have mixed types of attributes, the two different metrics are simply combined. Work by researchers in the machine learning field has shown that this approach performs poorly. This paper attempts to study a more recently developed distance metric and show that this metric is capable of measuring the importance of different attributes. With the use of discretisation for numeric-valued attributes, this method provides an integrated way in dealing with problem domains with mixtures of attribute types. Through detailed analyses, this paper tries to provide further insights into the understanding of nearest neighbour classification techniques and promote further use of this type of classification algorithm.
引用
收藏
页码:245 / 252
页数:8
相关论文
共 50 条
  • [1] A New Approach to Fuzzy-Rough Nearest Neighbour Classification
    Jensen, Richard
    Cornelis, Chris
    [J]. ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2008, 5306 : 310 - +
  • [2] Ontology-Aided Product Classification: A Nearest Neighbour Approach
    Abbott, Alastair A.
    Watson, Ian
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2011, 2011, 6880 : 348 - 362
  • [3] Hybrid dynamic k-nearest-neighbour and distance and attribute weighted method for classification
    Wu, Jia
    Cai, Zhi-hua
    Ao, Shuang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2012, 43 (04) : 378 - 384
  • [4] Skewness and Nearest Neighbour based Approach for Historical Document Classification
    Kavitha, A. S.
    Shivakumara, P.
    Kumar, G. Hemantha
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 602 - 606
  • [5] Nearest neighbour classification of otoneurological data
    Viikki, K
    Tapani, M
    Juhola, M
    Pyykkö, I
    [J]. HEALTH DATA IN THE INFORMATION SOCIETY, 2002, 90 : 450 - 454
  • [6] Nearest Neighbour Classification for Trajectory Data
    Sharma, Lokesh K.
    Vyas, Om Prakash
    Schieder, Simon
    Akasapu, Ajaya K.
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 180 - +
  • [7] Nearest Neighbour Distance Matrix Classification
    Sainin, Mohd Shamrie
    Alfred, Rayner
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 114 - 124
  • [8] Nearest Neighbour Classification with Monotonicity Constraints
    Duivesteijn, Wouter
    Feelders, Ad
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS, 2008, 5211 : 301 - 316
  • [9] Spam classification using nearest neighbour techniques
    Trudgian, DC
    [J]. INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 578 - 585
  • [10] Hybridisation of Genetic Programming and Nearest Neighbour for Classification
    Al-Sahaf, Harith
    Song, Andy
    Zhang, Mengjie
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 2650 - 2657