Making use of functional dependencies based on data to find better classification trees

被引:0
|
作者
Sug H. [1 ]
机构
[1] Dept. of Computer Eng, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan
关键词
Artificial intelligence; Classification; Decision trees; Functional dependency; Information systems; Knowledge modelling; Machine learning; Preprocessing;
D O I
10.46300/9106.2021.15.160
中图分类号
学科分类号
摘要
For the classification task of machine learning algorithms independency between conditional attributes is a precondition for success of data mining. On the other hand, decision trees are one of the mostly used machine learning algorithms because of their good understandability. So, because dependency between conditional attributes can cause more complex trees, supplying conditional attributes independent each other is very important, the requirement of conditional attributes for decision trees as well as other machine learning algorithms is that they are independent each other and dependent on decisional attributes only. Statistical method to check independence between attributes is Chi-square test, but the test can be effective for categorical attributes only. So, the applicability of Chi-square test is limited, because most datasets for data mining have mixed attributes of categorical and numerical. In order to overcome the problem, and as a way to test dependency between conditional attributes, a novel method based on functional dependency based on data that can be applied to any datasets irrespective of data type of attributes is suggested. After removing highly dependent attributes between conditional attributes, we can generate better decision trees. Experiments were performed to show that the method is effective, and the experiments showed very good results. © 2021, North Atlantic University Union NAUN. All rights reserved.
引用
收藏
页码:1475 / 1485
页数:10
相关论文
共 50 条
  • [41] Furthest-Pair-Based Decision Trees: Experimental Results on Big Data Classification
    Hassanat, Ahmad B. A.
    [J]. INFORMATION, 2018, 9 (11)
  • [42] Analyzing Induced Functional Dependencies from Spreadsheets in the GF Framework for Ontology-Based Data Access
    Gromez, Sergio Alejandro
    Fillottrani, Pablo Ruben
    [J]. COMPUTER SCIENCE-CACIC 2023, 2024, 2123 : 288 - 303
  • [43] USE OF CLASSIFICATION TREES AS AN AID IN UNDERSTANDING MISSING DATA: AN EXAMPLE FROM AN INTERNET-BASED SURVEY OF PATIENT CHARACTERISTICS AND RESOURCE UTILIZATION
    Zagar, A. J.
    Khan, S. A.
    Hayes, C. P.
    [J]. VALUE IN HEALTH, 2009, 12 (03) : A26 - A26
  • [44] Use of Mobile Health and Patient-Generated Data-Making Health Care Better by Making Health Care Different
    Bradley, Steven M.
    [J]. JAMA NETWORK OPEN, 2020, 3 (04)
  • [45] The SUPRAIC algorithm: A suppression immune based mechanism to find a representative training set in data classification tasks
    Figueredo, Grazziela P.
    Ebecken, Nelson F. F.
    Barbosa, Helio J. C.
    [J]. ARTIFICIAL IMMUNE SYSTEMS, PROCEEDINGS, 2007, 4628 : 59 - +
  • [46] Adaptive basis functions for prototype-based classification of functional data
    Melchert, Friedrich
    Bani, Gabriele
    Seiffert, Udo
    Biehl, Michael
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (24): : 18213 - 18223
  • [47] Adaptive basis functions for prototype-based classification of functional data
    Friedrich Melchert
    Gabriele Bani
    Udo Seiffert
    Michael Biehl
    [J]. Neural Computing and Applications, 2020, 32 : 18213 - 18223
  • [48] GA based optimal feature extraction method for functional data classification
    Wan, Jun
    Chen, Zehua
    Chen, Yingwu
    Bai, Zhidong
    [J]. World Academy of Science, Engineering and Technology, 2010, 38 : 371 - 377
  • [49] Adaptive Basis Functions for Prototype-based Classification of Functional Data
    Bani, Gabriele
    Seiffert, Udo
    Biehl, Michael
    Melchert, Friedrich
    [J]. 2017 12TH INTERNATIONAL WORKSHOP ON SELF-ORGANIZING MAPS AND LEARNING VECTOR QUANTIZATION, CLUSTERING AND DATA VISUALIZATION (WSOM), 2017, : 145 - 152
  • [50] GA based optimal feature extraction method for functional data classification
    Wan, Jun
    Chen, Zehua
    Chen, Yingwu
    Bai, Zhidong
    [J]. World Academy of Science, Engineering and Technology, 2010, 62 : 909 - 915