An Variable Selection Method of the Significance Multivariate Correlation Competitive Population Analysis for Near-Infrared Spectroscopy in Chemical Modeling

被引:8
|
作者
Wang, Yuxi [1 ]
Jia, Zhenhong [1 ]
Yang, Jie [2 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
基金
美国国家科学基金会;
关键词
Spectrochemical analysis; variable selection; the significant multivariate correlation; weighted bootstrap sampling; model population analysis; monte Carlo sampling; analytical techniques; partial least squares method; PARTIAL LEAST-SQUARES; REGRESSION; SHRINKAGE; CALIBRATION; PROJECTION; STRATEGY; SPACE; OPTIMIZATION; PERSPECTIVE; WAVELENGTHS;
D O I
10.1109/ACCESS.2019.2954115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The high dimensionality of spectral datasets makes it difficult to select the optimal subset of variables. This paper presents a new method for variable selection called the significant multivariate competitive population analysis (SMCPA), Which combines ideas of significant multivariate correlation (SMC) and model population analysis, and employs weighted bootstrap sampling (WBS) and exponential decline function (EDF) competition methods. In this study, the values of SMC distributions are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength. SMCPA sequentially selects N subsets of spectral wavelengths by N Monte Carlo sampling in an iterative and competitive procedure. In each sampling run, a fixed ratio of samples is used to build a calibrated partial least-squares model, and then SMC is performed to obtain the score and threshold values. Next, based on the significant multivariate correlation scores, the key variables are selected by two steps: the compulsory selection of exponential decline function and the competitive selection of adaptive weighted sampling. Finally, cross-validation(CV) is applied to select the optimal subset with the lowest root mean square error. This method is tested on three NIR spectral datasets and compared against three high-performance variable selection methods. The experimental results show that the proposed algorithm has the highest efficiency and the best selection effect, and can usually locate the optimal combination of key wavelength variables in a dataset. The evaluation result after PLS modeling is also the best.
引用
下载
收藏
页码:167195 / 167209
页数:15
相关论文
共 50 条
  • [31] Detection and identification of bacteria in an isolated system with near-infrared spectroscopy and multivariate analysis
    Alexandrakis, Dimitris
    Downey, Gerard
    Scannell, Amalia G. M.
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2008, 56 (10) : 3431 - 3437
  • [32] Verification of silage type using near-infrared spectroscopy combined with multivariate analysis
    Cozzolino, D.
    Fassio, A.
    Restaino, E.
    Fernandez, E.
    La Manna, A.
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2008, 56 (01) : 79 - 83
  • [33] Research on Variable Selection of Wheat Near-Infrared Spectroscopy Based on Latent Projective Graph
    Huan Ke-wei
    Zheng Feng
    Liu Xiao-xi
    Cai Xiao-long
    Cai Hong-xing
    Wang Rui
    Shi Xiao-guang
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2012, 32 (11) : 2962 - 2965
  • [34] Research Advance of Variable Selection Algorithms in Near Infrared Spectroscopy Analysis
    Song Xiang-zhong
    Tang Guo
    Zhang Lu-da
    Xiong Yan-mei
    Min Shun-geng
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37 (04) : 1048 - 1052
  • [35] Multivariate analysis and classification of the chemical quality of 7-aminocephalosporanic acid using near-infrared reflectance spectroscopy
    Andre, M
    ANALYTICAL CHEMISTRY, 2003, 75 (14) : 3460 - 3467
  • [36] Rapid and On-Scene Chemical Identification of Intact Explosives with Portable Near-Infrared Spectroscopy and Multivariate Data Analysis
    van Damme, Irene M. M.
    Mestres-Fito, Pol
    Ramaker, Henk-Jan
    Hulsbergen, Annemieke W. C.
    van der Heijden, Antoine E. D. M.
    Kranenburg, Ruben E. F.
    van Asten, Arian C. C.
    SENSORS, 2023, 23 (08)
  • [37] Modeling of complex viscosity changes in the curing of epoxy resins from near-infrared spectroscopy and multivariate regression analysis
    Garrido, M
    Larrechi, MS
    Rius, FX
    APPLIED SPECTROSCOPY, 2004, 58 (12) : 1424 - 1430
  • [38] An Evaluation Method of Quantitative Analysis Software for Near-Infrared Spectroscopy
    一种近红外光谱定量分析软件预测性能评价方法
    Yue, Xin (yuexin@craes.org.cn), 2025, 45 (01): : 213 - 221
  • [39] A New Signal Analysis Method for Functional Near-Infrared Spectroscopy
    Zhang Zhongpeng
    Hong Wenxue
    PROCEEDINGS OF 2016 8TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2016), 2016, : 100 - 106
  • [40] Application of Wavelet Component Selection and Orthogonal Signal Correction in the Multivariate Calibration by Near-Infrared Spectroscopy
    Peng, Dan
    Ji, Junmin
    Li, Xia
    Dong, Kaina
    ADVANCED RESEARCH ON COMPUTER SCIENCE AND INFORMATION ENGINEERING, PT I, 2011, 152 : 374 - 380