Active learning sample selection - based on multicriteria

被引:0
|
作者
He, Zhonghai [1 ,2 ]
Shen, Kun [1 ,4 ]
Zhang, Xiaofang [3 ]
机构
[1] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao, Peoples R China
[2] Hebei Key Lab Micronano Precis Opt Sensing & Meas, Qinhuangdao, Peoples R China
[3] Beijing Inst Technol, Sch Opt & Photon, Beijing, Peoples R China
[4] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao 066000, Peoples R China
关键词
Multivariate calibration; multicriteria modeling; active learning; sample selection; CALIBRATION; REGRESSION; DENSITY; QUERY; SETS;
D O I
10.1177/09670335231211618
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
In multivariate calibration problems, model performance is affected significantly by the calibration samples used during model building. In recent years, active learning methods have become one of the best methods for sample selection. However, most active learning methods only select instances from prediction uncertainty or sample space distance, and these single-criteria methods tend to select undesired samples. In addition, sample density characterizes the spatial information carried by the sample, but few studies in quantitative analysis utilize sample density alone to select calibration samples. Considering these issues, based on the k-means clustering algorithm, this paper proposes an active learning sample selection method (DIDAL), which combines the three criteria of diversity, informativeness and sample density. The most representative sample is iteratively selected for - addition to the calibration set for modeling and estimating the chemical concentration of analytes. Soybean meal and soy sauce samples were analyzed by DIDAL and compared with existing sample selection methods. The prediction results show that the DIDAL algorithm significantly outperforms several existing algorithms and is close to the performance of full-sample modeling. A model with high prediction accuracy can be constructed by selecting only a few samples using the DIDAL method.
引用
收藏
页码:289 / 297
页数:9
相关论文
共 50 条
  • [1] Sample Selection based Active Learning for Imbalanced Data
    Chairi, Ikram
    Alaoui, Souad
    Lyhyaoui, Abdelouahid
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 645 - 651
  • [2] Active partial label learning based on adaptive sample selection
    Yan Li
    Chang Liu
    Suyun Zhao
    Qiang Hua
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1603 - 1617
  • [3] Active partial label learning based on adaptive sample selection
    Li, Yan
    Liu, Chang
    Zhao, Suyun
    Hua, Qiang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (06) : 1603 - 1617
  • [4] ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy
    Pan, Longfei
    Wang, Xiaojun
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 533 - 538
  • [5] Graph Node Based Interpretability Guided Sample Selection for Active Learning
    Mahapatra, Dwarikanath
    Poellinger, Alexander
    Reyes, Mauricio
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (03) : 661 - 673
  • [6] Performance enhancement-based active learning sample selection method
    He, Zhonghai
    Song, Shijie
    Shen, Kun
    Zhang, Xiaofang
    JOURNAL OF CHEMOMETRICS, 2022, 36 (03)
  • [7] A serial sample selection framework for active learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 435 - 446
  • [8] A NEW METHOD FOR SAMPLE SELECTION IN ACTIVE LEARNING
    Chen, Wei
    Liu, Gang
    Guo, Jun
    Guo, Yu-Jing
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2270 - 2274
  • [9] A Serial Sample Selection Framework for Active Learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 435 - 446
  • [10] UNSUPERVISED SAMPLE SELECTION FOR ACTIVE LEARNING WITH QUADRATIC PROGRAMMING
    Wang, Yunbin
    Song, Na
    Wang, Shiping
    Journal of Applied and Numerical Optimization, 2024, 6 (03): : 339 - 350