Improve the classifier accuracy for continuous attributes in biomedical datasets using a new discretization method

被引:15
|
作者
Madhu, G. [1 ]
Rajinikanth, T. V. [2 ]
Govardhan, A. [3 ]
机构
[1] VNRVJIET, Dept Informat Technol, Hyderabad 90, Andhra Pradesh, India
[2] SNIST, Dept Comp Sci & Engn, Hyderabad, Andhra Pradesh, India
[3] JNT Univ, Sch Informat Technol, Hyderabad 85, Andhra Pradesh, India
关键词
continuous attributes; classification; data mining; discretization; discrete values; CHI2; ALGORITHM;
D O I
10.1016/j.procs.2014.05.315
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In real-time data mining applications discrete values play vital role in knowledge representation as they are easy to handle and very close to knowledge level representation than continuous attributes. Discretization is a major step in data mining process where continuous attributes are transformed into discrete values. However, most of the classifications algorithms are require discrete values as the input. Even though some data mining algorithms directly contract with continuous attributes, the learning process yields low quality results. In this paper, we introduce a new discretization method based on standard deviation technique called 'z-score' for continuous attributes on biomedical datasets. We compare performance of the proposed algorithm with the state-of-the-art discretization techniques. The experiment results show the efficiency in terms of accuracy and also minimize the classifier confusion for decision making process. (C) 2014 Published by Elsevier B.V. Open access under CC BY-NC-ND license.
引用
收藏
页码:671 / 679
页数:9
相关论文
共 50 条
  • [1] Improvement of decision accuracy using discretization of continuous attributes
    Wu, QingXiang
    Bell, David
    McGinnity, Martin
    Prasad, Girijesh
    Qi, Guilin
    Huang, Xi
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 674 - 683
  • [2] A new method for discretization of continuous attributes based on VPRS
    Wei, Jin-Mao
    Wang, Guo-Ying
    Kong, Xiang-Ming
    Li, Shu-Jie
    Wang, Shu-Qin
    Liu, Da-You
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2006, 4259 : 183 - 190
  • [3] New method of discretization of continuous attributes in rough sets
    Miao, Duo-Qian
    Zidonghua Xuebao/Acta Automatica Sinica, 2001, 27 (03): : 296 - 302
  • [4] New method for discretization of continuous attributes in rough set theory
    Cong, Rong
    Wang, Xiukun
    Li, Kai
    Yang, Nanhai
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (02) : 250 - 253
  • [5] A Improved Method of Discretization of Continuous Attributes
    Chen, Shunling
    Tang, Ling
    Liu, Weijun
    Li, Yonghong
    2011 2ND INTERNATIONAL CONFERENCE ON CHALLENGES IN ENVIRONMENTAL SCIENCE AND COMPUTER ENGINEERING (CESCE 2011), VOL 11, PT A, 2011, 11 : 213 - 217
  • [6] FUSINTER: A method for discretization of continuous attributes
    Zighed, DA
    Rabaseda, S
    Rakotomalala, R
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 1998, 6 (03) : 307 - 326
  • [7] A dynamic method for discretization of continuous attributes
    Hwang, GJ
    Li, FM
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 506 - 511
  • [8] New method for discretization of continuous attributes in rough set theory
    Rong Cong1
    2.Education Technology Center
    3.The 92538 Unit of PLA
    Journal of Systems Engineering and Electronics, 2010, 21 (02) : 250 - 253
  • [9] Discretization Method of Continuous Attributes Based on Decision Attributes
    Sun, Yingjuan
    Ren, Zengqiang
    Zhou, Tong
    Zhai, Yandong
    Pu, Dongbing
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, AICI 2010, PT II, 2010, 6320 : 367 - 373
  • [10] Khiops: A statistical discretization method of continuous attributes
    Boulle, M
    MACHINE LEARNING, 2004, 55 (01) : 53 - 69