Active learning sample selection - based on multicriteria

被引:0
|
作者
He, Zhonghai [1 ,2 ]
Shen, Kun [1 ,4 ]
Zhang, Xiaofang [3 ]
机构
[1] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao, Peoples R China
[2] Hebei Key Lab Micronano Precis Opt Sensing & Meas, Qinhuangdao, Peoples R China
[3] Beijing Inst Technol, Sch Opt & Photon, Beijing, Peoples R China
[4] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao 066000, Peoples R China
关键词
Multivariate calibration; multicriteria modeling; active learning; sample selection; CALIBRATION; REGRESSION; DENSITY; QUERY; SETS;
D O I
10.1177/09670335231211618
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
In multivariate calibration problems, model performance is affected significantly by the calibration samples used during model building. In recent years, active learning methods have become one of the best methods for sample selection. However, most active learning methods only select instances from prediction uncertainty or sample space distance, and these single-criteria methods tend to select undesired samples. In addition, sample density characterizes the spatial information carried by the sample, but few studies in quantitative analysis utilize sample density alone to select calibration samples. Considering these issues, based on the k-means clustering algorithm, this paper proposes an active learning sample selection method (DIDAL), which combines the three criteria of diversity, informativeness and sample density. The most representative sample is iteratively selected for - addition to the calibration set for modeling and estimating the chemical concentration of analytes. Soybean meal and soy sauce samples were analyzed by DIDAL and compared with existing sample selection methods. The prediction results show that the DIDAL algorithm significantly outperforms several existing algorithms and is close to the performance of full-sample modeling. A model with high prediction accuracy can be constructed by selecting only a few samples using the DIDAL method.
引用
收藏
页码:289 / 297
页数:9
相关论文
共 50 条
  • [21] AN ACTIVE LEARNING METHOD USING CLUSTERING AND COMMITTEE-BASED SAMPLE SELECTION FOR SOUND EVENT CLASSIFICATION
    Zhao Shuyang
    Heittola, Toni
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 116 - 120
  • [22] Gradient based sample selection for online continual learning
    Aljundi, Rahaf
    Lin, Min
    Goujaud, Baptiste
    Bengio, Yoshua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [23] Lazy texture selection based on active learning
    Tian Xia
    Qing Wu
    Chun Chen
    Yizhou Yu
    The Visual Computer, 2010, 26 : 157 - 169
  • [24] Lazy texture selection based on active learning
    Xia, Tian
    Wu, Qing
    Chen, Chun
    Yu, Yizhou
    VISUAL COMPUTER, 2010, 26 (03): : 157 - 169
  • [25] Active learning for segmentation based on Bayesian sample queries
    Ozdemir, Firat
    Peng, Zixuan
    Fuernstahl, Philipp
    Tanner, Christine
    Goksel, Orcun
    KNOWLEDGE-BASED SYSTEMS, 2021, 214
  • [26] Sleep Stage Classification by Ensemble Learning Methods with Active Sample Selection Techniques
    Ihan, Hamza Osman
    Avci, Cafer
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [27] Learning to Sample: an Active Learning Framework
    Shao, Jingyu
    Wang, Qing
    Liu, Fangbing
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 538 - 547
  • [28] Autoencoder based sample selection for self-taught learning
    Feng, Siwei
    Yu, Han
    Duarte, Marco F.
    KNOWLEDGE-BASED SYSTEMS, 2020, 192
  • [29] Sample selection-based hierarchical extreme learning machine
    Xu, Xinzheng
    Li, Shan
    Liang, Tianming
    Sun, Tongfeng
    NEUROCOMPUTING, 2020, 377 (377) : 95 - 102
  • [30] Entropy-based Sample Selection for Online Continual Learning
    Wiewel, Felix
    Yang, Bin
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1477 - 1481