A hybrid evolutionary approach to construct optimal decision trees with large data sets

被引:0
|
作者
Patil, D. V. [1 ]
Bichkar, R. S. [2 ]
机构
[1] SGGS Inst Engn & Tech, Nanded, MS, India
[2] SGGS Inst Engn & Tech, Dept Elect & Telecommun Engn, Nanded, MS, India
关键词
large data sets; decision tree; genetic algorithm; genetically evolved decision Tree; training set size; and classification accuracy; Comprehensibility;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining environments produces large Volume of data. The large amount of knowledge contains can be utilized to improve decision-making process of an organization. Large amount of available data when used for decision tree construction builds large sized trees that are incomprehensible to human experts. The learning process on this high volume data becomes very slow, as it has to be done serially on available large datasets. Our ultimate goal is to build smaller trees with equally accurate solutions with randomly selected sampled data. We experimented on techniques based on the idea of incremental random sampling combined with genetic algorithms that uses global search techniques to evolve decision Trees to obtain compact representation of large data set. Experiments performed on some data sets proved that the proposed random sampling procedures with genetic algorithms to build decision Trees gives relatively smaller trees as compared to other methods but equally accurate solution as other methods. The method incorporates optimization with the Comprehensibility and scalability. We tried to explore the method using that we can avoid problems like slow execution, overloading of memory and processor with very large database can be avoided using the technique.
引用
收藏
页码:603 / +
页数:2
相关论文
共 50 条
  • [1] Adjusting SVMs for large data sets using balanced decision trees
    Vatamanu, Cristina
    Gavrilut, Dragos Teodor
    Popoiu, George
    2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, : 223 - 229
  • [2] Using decision trees to construct optimal acoustic cues
    Robbe, S
    Bonneau, A
    Coste, S
    Laprie, Y
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 137 - 140
  • [3] Mixed decision trees: An evolutionary approach
    Kretowski, Marek
    Grzes, Marek
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 260 - 269
  • [4] An Improved Error-Based Pruning Algorithm of Decision Trees on Large Data Sets
    Peng, Yi
    Lu, Yu-Tong
    Chen, Zhi-Guang
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 33 - 37
  • [5] Using rough sets to construct sense type decision trees for text categorization
    Bleyberg, MZ
    Elumalai, A
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 19 - 24
  • [6] Credal Decision Trees to Classify Noisy Data Sets
    Mantas, Carlos J.
    Abellan, Joaquin
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, HAIS 2014, 2014, 8480 : 689 - 696
  • [7] Bagging Decision Trees on Data Sets with Classification Noise
    Abellan, Joaquin
    Masegosa, Andres R.
    FOUNDATIONS OF INFORMATION AND KNOWLEDGE SYSTEMS, PROCEEDINGS, 2010, 5956 : 248 - 265
  • [8] A BASIC PROGRAM TO CONSTRUCT EVOLUTIONARY TREES FROM RESTRICTION ENDONUCLEASE DATA
    GENTZBITTEL, L
    NICOLAS, P
    JOURNAL OF HEREDITY, 1989, 80 (03) : 254 - 254
  • [9] Global induction of oblique decision trees: An evolutionary approach
    Kretowski, M
    Grzes, M
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2005, : 309 - 318
  • [10] Building fast decision trees from large training sets
    Franco-Arcega, A.
    Carrasco-Ochoa, J. A.
    Sanchez-Diaz, G.
    Fco Martinez-Trinidad, J.
    INTELLIGENT DATA ANALYSIS, 2012, 16 (04) : 649 - 664