An Variable Selection Method of the Significance Multivariate Correlation Competitive Population Analysis for Near-Infrared Spectroscopy in Chemical Modeling

被引:8
|
作者
Wang, Yuxi [1 ]
Jia, Zhenhong [1 ]
Yang, Jie [2 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
基金
美国国家科学基金会;
关键词
Spectrochemical analysis; variable selection; the significant multivariate correlation; weighted bootstrap sampling; model population analysis; monte Carlo sampling; analytical techniques; partial least squares method; PARTIAL LEAST-SQUARES; REGRESSION; SHRINKAGE; CALIBRATION; PROJECTION; STRATEGY; SPACE; OPTIMIZATION; PERSPECTIVE; WAVELENGTHS;
D O I
10.1109/ACCESS.2019.2954115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The high dimensionality of spectral datasets makes it difficult to select the optimal subset of variables. This paper presents a new method for variable selection called the significant multivariate competitive population analysis (SMCPA), Which combines ideas of significant multivariate correlation (SMC) and model population analysis, and employs weighted bootstrap sampling (WBS) and exponential decline function (EDF) competition methods. In this study, the values of SMC distributions are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength. SMCPA sequentially selects N subsets of spectral wavelengths by N Monte Carlo sampling in an iterative and competitive procedure. In each sampling run, a fixed ratio of samples is used to build a calibrated partial least-squares model, and then SMC is performed to obtain the score and threshold values. Next, based on the significant multivariate correlation scores, the key variables are selected by two steps: the compulsory selection of exponential decline function and the competitive selection of adaptive weighted sampling. Finally, cross-validation(CV) is applied to select the optimal subset with the lowest root mean square error. This method is tested on three NIR spectral datasets and compared against three high-performance variable selection methods. The experimental results show that the proposed algorithm has the highest efficiency and the best selection effect, and can usually locate the optimal combination of key wavelength variables in a dataset. The evaluation result after PLS modeling is also the best.
引用
下载
收藏
页码:167195 / 167209
页数:15
相关论文
共 50 条
  • [21] Detection of beef TVB-N by visible and near-infrared spectroscopy combined with variable selection method
    Xu, Y. (xuyang@cau.edu.cn), 1600, Journal of Jiangsu University (Natural Science Edition) (34):
  • [23] Variable space boosting partial least squares for multivariate calibration of near-infrared spectroscopy
    Bian, Xihui
    Li, Shujuan
    Shao, Xueguang
    Liu, Peng
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 158 : 174 - 179
  • [25] A New Sample-Selection and Modeling Method Based on Near-Infrared Spectroscopy and Its Industrial Application
    贺凯迅
    程辉
    钱锋
    Journal of Donghua University(English Edition), 2014, 31 (02) : 207 - 211
  • [26] Determination of Soil Organic Matter and Total Nitrogen from Visible Near-Infrared Spectroscopy by Multivariate Models and Variable Selection Techniques
    Zhang, Hailiang
    Zhang, Jing
    Chen, Zailiang
    Xie, Chaoyong
    Zhan, Baishao
    Luo, Wei
    Liu, Xuemei
    EURASIAN SOIL SCIENCE, 2024, 57 (06) : 917 - 930
  • [27] POPULATION DEFINITION, SAMPLE SELECTION, AND CALIBRATION PROCEDURES FOR NEAR-INFRARED REFLECTANCE SPECTROSCOPY
    SHENK, JS
    WESTERHAUS, MO
    CROP SCIENCE, 1991, 31 (02) : 469 - 474
  • [28] Weighted SPXY method for calibration set selection for composition analysis based on near-infrared spectroscopy
    Tian, Han
    Zhang, Linna
    Li, Ming
    Wang, Yue
    Sheng, Dinggao
    Liu, Jun
    Wang, Chengmin
    INFRARED PHYSICS & TECHNOLOGY, 2018, 95 : 88 - 92
  • [29] Authentication of Antibiotics Using Portable Near-Infrared Spectroscopy and Multivariate Data Analysis
    Assi, Sulaf
    Arafat, Basel
    Lawson-Wood, Kathryn
    Robertson, Ian
    APPLIED SPECTROSCOPY, 2021, 75 (04) : 434 - 444
  • [30] Discrimination of cracked soybean seeds by near-infrared spectroscopy and random forest variable selection
    Wang, Liusan
    Huang, Ziliang
    Wang, Rujing
    INFRARED PHYSICS & TECHNOLOGY, 2021, 115