Making use of functional dependencies based on data to find better classification trees

被引:0
|
作者
Sug H. [1 ]
机构
[1] Dept. of Computer Eng, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan
关键词
Artificial intelligence; Classification; Decision trees; Functional dependency; Information systems; Knowledge modelling; Machine learning; Preprocessing;
D O I
10.46300/9106.2021.15.160
中图分类号
学科分类号
摘要
For the classification task of machine learning algorithms independency between conditional attributes is a precondition for success of data mining. On the other hand, decision trees are one of the mostly used machine learning algorithms because of their good understandability. So, because dependency between conditional attributes can cause more complex trees, supplying conditional attributes independent each other is very important, the requirement of conditional attributes for decision trees as well as other machine learning algorithms is that they are independent each other and dependent on decisional attributes only. Statistical method to check independence between attributes is Chi-square test, but the test can be effective for categorical attributes only. So, the applicability of Chi-square test is limited, because most datasets for data mining have mixed attributes of categorical and numerical. In order to overcome the problem, and as a way to test dependency between conditional attributes, a novel method based on functional dependency based on data that can be applied to any datasets irrespective of data type of attributes is suggested. After removing highly dependent attributes between conditional attributes, we can generate better decision trees. Experiments were performed to show that the method is effective, and the experiments showed very good results. © 2021, North Atlantic University Union NAUN. All rights reserved.
引用
收藏
页码:1475 / 1485
页数:10
相关论文
共 50 条
  • [21] Filtering-based approaches for functional data classification
    Jiang, Ci-Ren
    Chen, Lu-Hung
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2020, 12 (04):
  • [22] Spatial depth-based classification for functional data
    Sguera, Carlo
    Galeano, Pedro
    Lillo, Rosa
    [J]. TEST, 2014, 23 (04) : 725 - 750
  • [23] Spatial depth-based classification for functional data
    Carlo Sguera
    Pedro Galeano
    Rosa Lillo
    [J]. TEST, 2014, 23 : 725 - 750
  • [24] Music Genre Classification Based on Functional Data Analysis
    Shen, Jiahong
    Xiao, Guangrun
    [J]. IEEE Access, 2024, 12 : 185482 - 185491
  • [25] Model-based clustering and classification of functional data
    Chamroukhi, Faicel
    Nguyen, Hien D.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (04)
  • [26] Use of Hoeffiding trees in concept based data stream mining
    Hoeglinger, Stefan
    Pears, Russel
    [J]. 2007 THIRD INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY, 2007, : 52 - 57
  • [27] Suppression Based Immune Mechanism to Find a Representative Training Set in Data Classification Tasks
    Figueredo, Grazziela P.
    Ebecken, Nelson F. F.
    Barbosa, Helio J. C.
    [J]. GECCO 2007: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2007, : 171 - 171
  • [28] The use of decision trees in the classification of beach forms/patterns on IKONOS-2 data
    Teodoro, A. C.
    Ferreira, D.
    Goncalves, H.
    [J]. EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS IV, 2013, 8893
  • [29] Supervised classification of curves via a combined use of functional data analysis and tree-based methods
    Maturo, Fabrizio
    Verde, Rosanna
    [J]. COMPUTATIONAL STATISTICS, 2023, 38 (01) : 419 - 459
  • [30] Supervised classification of curves via a combined use of functional data analysis and tree-based methods
    Fabrizio Maturo
    Rosanna Verde
    [J]. Computational Statistics, 2023, 38 : 419 - 459