Extending Sample Information for Small Data Set Prediction

被引:10
|
作者
Chen, Hung-Yuj [1 ]
Li, Der-Chiang [2 ]
Lin, Liang-Sian [3 ]
机构
[1] Natl Cheng Kung Univ, Dept Informat Management, Tainan, Taiwan
[2] Natl Cheng Kung Univ, Dept Ind & Informat Management, Tainan, Taiwan
[3] Ind Technol Res Inst, Informat & Commun Res Labs, Hsinchu, Taiwan
关键词
small dataset learning; extended data attributes; box-chart-based domain estimation; support vector regression; CLUSTER-ANALYSIS; PERFORMANCE; DIFFUSION; NETWORK;
D O I
10.1109/IIAI-AAI.2016.16
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a method that focuses on creating new data attributes by using fuzzy operations for solving small dataset learning problems. Using the idea of fuzzy rules, the membership value of antecedents in each rule can be extracted from the data point. Therefore, in this research, those membership values will be deemed as new data features and the data dimensionality will be extended. To test the effectiveness of the proposed method, the data set with new data features and the one with no special treatment will be utilized respectively to build predictive models. Paired t-test is carried out to see how effective the proposed method can improve the learning on the basis of small sample sets.
引用
收藏
页码:710 / 714
页数:5
相关论文
共 50 条
  • [1] Extending Attribute Information for Small Data Set Classification
    Li, Der-Chiang
    Liu, Chiao-Wen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (03) : 452 - 464
  • [2] Non-parametric Statistical Assistance in Virtual Sample Selection for Small Data Set Prediction
    Lin, Yao-San
    Lin, Liang-Sian
    Li, Der-Chiang
    Liao, Wei-Lin
    3RD INTERNATIONAL CONFERENCE ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY (ACIT 2015) 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND INTELLIGENCE (CSI 2015), 2015, : 369 - 373
  • [3] Nutrition Information in Oncology — Extending the Electronic Patient-Record Data Set
    Priscila A. Maranhão
    Ana Margarida Pereira
    Conceição Calhau
    Paula Ravasco
    Federico Bozzetti
    Alessandro Laviano
    Liz Isenring
    Elisa V. Bandera
    Maureen B. Huhmann
    Pedro Vieira-Marques
    Ricardo J. Cruz-Correia
    Journal of Medical Systems, 2020, 44
  • [4] Nutrition Information in Oncology - Extending the Electronic Patient-Record Data Set
    Maranhao, Priscila A.
    Pereira, Ana Margarida
    Calhau, Conceicao
    Ravasco, Paula
    Bozzetti, Federico
    Laviano, Alessandro
    Isenring, Liz
    Bandera, Elisa V.
    B. Huhmann, Maureen
    Vieira-Marques, Pedro
    Cruz-Correia, Ricardo J.
    JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (11)
  • [5] EXTENDING THE USES OF THE LRHS DATA SET
    MADDOX, GL
    FILLENBAUM, GG
    GEORGE, LK
    REVIEW OF PUBLIC DATA USE, 1979, 7 (3-4): : 57 - 62
  • [6] Product quality prediction method in small sample data environment
    Liu, Feixiang
    Dai, Yiru
    ADVANCED ENGINEERING INFORMATICS, 2023, 56
  • [7] Progressive prediction method for failure data with small sample size
    WANG Zhi-hua1
    2.Science and Technology on Space Intelligent Control Laboratory
    航空动力学报, 2011, 26 (09) : 2049 - 2053
  • [8] Progressive prediction method for failure data with small sample size
    Wang, Zhi-Hua
    Fu, Hui-Min
    Liu, Cheng-Rui
    Hangkong Dongli Xuebao/Journal of Aerospace Power, 2011, 26 (09): : 2049 - 2053
  • [9] Pricing and sample set strategies of data providers under quality information asymmetry
    Xing A.
    Wang H.
    Journal of the Operational Research Society, 2024, 75 (02) : 278 - 296
  • [10] Small Sample Fault Data Prediction Study Based on Weibull Model
    Wang, Hongpo
    Yang, Ge
    Bai, Linnan
    Yin, Juan
    Li, Qiang
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), 2015, : 9 - 14