A Structural Sampling Technique for Better Decision Trees

被引:0
|
作者
Sug, Hyontai [1 ]
机构
[1] Dongseo Univ, Div Comp & Informat Engn, Pusan, South Korea
关键词
decision trees; sampling; CART; C4.5;
D O I
10.1109/ACIIDS.2009.24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since data mining problems contain a large amount of data, sampling is a necessity for the success of the task. Decision trees have been developed for prediction, and finding decision trees with smaller error rates has been a major task for their success. This paper suggests a structural sampling technique that is based on a generated decision tree, where the tree is generated based on fast and dirty tree generation algorithm. Experiments with several sample sizes and representative decision tree algorithms showed that the method is more effective with respect to decision tree size and error rate than conventional random sampling method especially for small sample size.
引用
收藏
页码:24 / 27
页数:4
相关论文
共 50 条
  • [1] Sampling methods in decision trees
    Mehrotra, KG
    Jeragh, M
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 1069 - 1075
  • [2] Generating better decision trees
    [J]. 1600, Morgan Kaufmann Publ Inc, San Mateo, CA, USA (01):
  • [3] FLUID SAMPLING . A BETTER TECHNIQUE
    JACOBY, RH
    TRACHT, JH
    [J]. HYDROCARBON PROCESSING, 1970, 49 (2P1): : 101 - &
  • [4] Build better diagnostic decision trees
    Assaf, T
    Dugan, JB
    [J]. IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2005, 8 (03) : 48 - 53
  • [5] Predicate selection for structural decision trees
    Ng, KS
    Lloyd, JW
    [J]. INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2005, 3625 : 264 - 278
  • [6] A repetitive sampling plan using decision trees method
    Azam, Muhammad
    Aslam, Muhammad
    Niaki, Seyed Taghi Akhavan
    [J]. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2020, 23 (04): : 789 - 807
  • [7] An anonymization technique using intersected decision trees
    Fletcher, Sam
    Islam, Md Zahidul
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (03) : 297 - 304
  • [8] APPLICATION OF DECISION THEORY TO SAMPLING IN A STRUCTURAL CONTEXT
    SHAH, HC
    BURNES, JA
    [J]. PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS, 1969, 43 (MAY): : 79 - &
  • [9] A new sampling strategy for building decision trees from large databases
    Chauchat, JH
    Rakotomalala, R
    [J]. DATA ANALYSIS, CLASSIFICATION, AND RELATED METHODS, 2000, : 199 - 204
  • [10] Bigger Data Is Better for Molecular Diagnosis Tests Based on Decision Trees
    Floares, Alexandru G.
    Calin, George A.
    Manolache, Florin B.
    [J]. DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 288 - 295