Extending Sample Information for Small Data Set Prediction

被引:10
|
作者
Chen, Hung-Yuj [1 ]
Li, Der-Chiang [2 ]
Lin, Liang-Sian [3 ]
机构
[1] Natl Cheng Kung Univ, Dept Informat Management, Tainan, Taiwan
[2] Natl Cheng Kung Univ, Dept Ind & Informat Management, Tainan, Taiwan
[3] Ind Technol Res Inst, Informat & Commun Res Labs, Hsinchu, Taiwan
关键词
small dataset learning; extended data attributes; box-chart-based domain estimation; support vector regression; CLUSTER-ANALYSIS; PERFORMANCE; DIFFUSION; NETWORK;
D O I
10.1109/IIAI-AAI.2016.16
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a method that focuses on creating new data attributes by using fuzzy operations for solving small dataset learning problems. Using the idea of fuzzy rules, the membership value of antecedents in each rule can be extracted from the data point. Therefore, in this research, those membership values will be deemed as new data features and the data dimensionality will be extended. To test the effectiveness of the proposed method, the data set with new data features and the one with no special treatment will be utilized respectively to build predictive models. Paired t-test is carried out to see how effective the proposed method can improve the learning on the basis of small sample sets.
引用
收藏
页码:710 / 714
页数:5
相关论文
共 50 条
  • [31] Prediction of the Periglacial Debris Flow in Southeast Tibet Based on Imbalanced Small Sample Data
    Du, Jun
    Zhang, Hong-ya
    Hu, Kai-heng
    Wang, Lin
    Dong, Lin-yao
    WATER, 2023, 15 (02)
  • [32] Small area prediction of quantiles for zero-inflated data and an informative sample design
    Berg, Emily
    Lee, Danhyang
    STATISTICAL THEORY AND RELATED FIELDS, 2019, 3 (02) : 114 - 128
  • [33] Identification information analysis of sample train set subspace
    Fu, XiangFei
    Zhou, Jiliu
    Lang, FangNian
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 633 - +
  • [34] Extending Retrospective Encoding for Robust Recovery of the Multistatic Data Set
    Ali, Rehman
    Herickhoff, Carl D.
    Hyun, Dongwoon
    Dahl, Jeremy J.
    Bottenus, Nick
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2020, 67 (05) : 943 - 956
  • [35] Calligraphy Character Synthesis from Small Sample Set
    Yu, Kai
    Yuan, Zhenming
    EMERGENCE OF DIGITAL LIBRARIES - RESEARCH AND PRACTICES, 2014, 8839 : 379 - 384
  • [36] ISOTOPE RATIO MONITORING GCMS - EXTENDING THE RANGE FOR SMALL SAMPLE SIZES
    HILKERT, AW
    BRAND, WA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 38 - GEOC
  • [37] Nutrigenomic Information in the openEHR Data Set
    Maranhao, Priscila Alves
    Bacelar-Silva, Gustavo Marisio
    Goncalves Ferreira, Duarte Nuno
    Calhau, Conceicao
    Vieira-Marques, Pedro
    Cruz-Correia, Ricardo Joao
    APPLIED CLINICAL INFORMATICS, 2018, 9 (01): : 221 - 231
  • [38] Extending electronic data interchange: Accounting information on the Internet
    Schneider, GP
    MANAGING INFORMATION TECHNOLOGY RESOURCES AND APPLICATIONS IN THE WORLD ECONOMY, 1997, : 431 - 433
  • [39] A genetic algorithm-based virtual sample generation technique to improve small data set learning
    Li, Der-Chiang
    Wen, I-Hsiang
    NEUROCOMPUTING, 2014, 143 : 222 - 230
  • [40] THE PREDICTION ERROR OF AUTOREGRESSIVE SMALL SAMPLE MODELS
    BROERSEN, PMT
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (05): : 858 - 860