Improve the classifier accuracy for continuous attributes in biomedical datasets using a new discretization method

被引:15
|
作者
Madhu, G. [1 ]
Rajinikanth, T. V. [2 ]
Govardhan, A. [3 ]
机构
[1] VNRVJIET, Dept Informat Technol, Hyderabad 90, Andhra Pradesh, India
[2] SNIST, Dept Comp Sci & Engn, Hyderabad, Andhra Pradesh, India
[3] JNT Univ, Sch Informat Technol, Hyderabad 85, Andhra Pradesh, India
关键词
continuous attributes; classification; data mining; discretization; discrete values; CHI2; ALGORITHM;
D O I
10.1016/j.procs.2014.05.315
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In real-time data mining applications discrete values play vital role in knowledge representation as they are easy to handle and very close to knowledge level representation than continuous attributes. Discretization is a major step in data mining process where continuous attributes are transformed into discrete values. However, most of the classifications algorithms are require discrete values as the input. Even though some data mining algorithms directly contract with continuous attributes, the learning process yields low quality results. In this paper, we introduce a new discretization method based on standard deviation technique called 'z-score' for continuous attributes on biomedical datasets. We compare performance of the proposed algorithm with the state-of-the-art discretization techniques. The experiment results show the efficiency in terms of accuracy and also minimize the classifier confusion for decision making process. (C) 2014 Published by Elsevier B.V. Open access under CC BY-NC-ND license.
引用
收藏
页码:671 / 679
页数:9
相关论文
共 50 条
  • [21] A new algorithm of discretization of decision dable's continuous attributes
    Sun, BQ
    Yang, J
    Chen, SL
    Zhao, M
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2005, 2 : 786 - 789
  • [22] Influence of the Discretization Method on the Integration Accuracy of Observers with Continuous Feedback
    Comanescu, Mihai
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2011,
  • [23] Using Accuracy-Based Learning Classifier Systems for Imbalance Datasets
    Udomthanapong, Sornchai
    Tamee, Kreangsak
    Pinngern, Ouen
    ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 21 - 24
  • [24] Statistical Discretization of Continuous Attributes Using Kolmogorov-Smirnov Test
    Abachi, Hadi Mohammadzadeh
    Hosseini, Saeid
    Maskouni, Mojtaba Amiri
    Kangavari, Mohammadreza
    Cheung, Ngai-Man
    DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : 309 - 315
  • [25] Using dynamic attributes to improve customer segmentation accuracy
    Department of Management Science and Engineering, Harbin Institute of Technology, Mailbox 1223, 150001 Harbin, China
    WSEAS Trans. Inf. Sci. Appl., 2006, 4 (698-703):
  • [26] Using Gene Pair Combinations to Improve the Accuracy of the PAM Classifier
    Chopra, Pankaj
    Kang, Jaewoo
    Lee, Jinseung
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 174 - +
  • [27] Supervised discretization of continuous-valued attributes for classification using RACER algorithm
    Toulabinejad, Elaheh
    Mirsafaei, Mohammad
    Basiri, Alireza
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [28] A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
    Harvey, Ethan
    Chen, Wansu
    Kent, David M.
    Hughes, Michael C.
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 129 - 144
  • [29] A new discretization method of governing equations for high order accuracy
    Kim, Dehee
    Kwon, Jang Hyuk
    COMPUTATIONAL FLUID DYNAMICS 2004, PROCEEDINGS, 2006, : 429 - +
  • [30] Diabetes disease prediction system using HNB classifier based on discretization method
    Al-Hameli, Bassam Abdo
    Alsewari, AbdulRahman A.
    Basurra, Shadi S.
    Bhogal, Jagdev
    Ali, Mohammed A. H.
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2023, 20 (01)