Application of random forest regression to spectral multivariate calibration

被引:29
|
作者
Ghasemi, Jahan B. [1 ]
Tavakoli, Hossein [1 ]
机构
[1] KN Toosi Univ Technol, Fac Sci, Dept Chem, Tehran, Iran
关键词
SIMULTANEOUS SPECTROPHOTOMETRIC DETERMINATION; CLASSIFICATION; SPECTROSCOPY; UNCERTAINTY; PARAMETERS; TREES; QSAR; TOOL;
D O I
10.1039/c3ay26338j
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The performance of the random forest (RF) algorithm on the spectroscopic data was studied and compared by bootstrap aggregating of classification and regression trees (bagging CART), partial least squares (PLS) and nonlinear support vector machine (SVM) algorithms. The performances of these algorithms were investigated on four real data sets; these data sets were: (1) UV-Visible spectra of two cardiovascular drugs (hydrochlorothiazide and valsartan); (2) visible spectra of copper, cobalt and nickel complexes with 4-(2-pyridylazo) resorcinol (PAR) as chromogenic reagent; (3) near infrared spectra of corn samples, and (4) near infrared diffuse transmission spectra of pharmaceutical tablets. Results indicate that besides its comparable accuracy and mathematical simplicity, it is computationally fast and robust to noise. Therefore, RF is a useful tool for regression studies and has potential for modeling linear and nonlinear multivariate calibration.
引用
收藏
页码:1863 / 1871
页数:9
相关论文
共 50 条
  • [1] MULTIVARIATE CALIBRATION WITH SUPPORT VECTOR REGRESSION BASED ON RANDOM PROJECTION
    Chen, Shouyin
    Peng, Jiangtao
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2012, 10 (06)
  • [2] Multivariate Regression with Calibration
    Liu, Han
    Wang, Lie
    Zhao, Tuo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [3] Assessment of pile drivability using random forest regression and multivariate adaptive regression splines
    Zhang, Wengang
    Wu, Chongzhi
    Li, Yongqin
    Wang, Lin
    Samui, P.
    [J]. GEORISK-ASSESSMENT AND MANAGEMENT OF RISK FOR ENGINEERED SYSTEMS AND GEOHAZARDS, 2021, 15 (01) : 27 - 40
  • [4] Multivariate calibration of spectral data using dual-domain regression analysis
    Tan, HW
    Brown, SD
    [J]. ANALYTICA CHIMICA ACTA, 2003, 490 (1-2) : 291 - 301
  • [5] MCCV stacked regression for model combination and fast spectral interval selection in multivariate calibration
    Xu, Lu
    Jiang, Jlan-Hui
    Zhou, Yan-Ping
    Wu, Hai-Long
    Shen, Guo-Li
    Yu, Ru-Qin
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 87 (02) : 226 - 230
  • [6] Multivariate calibration with robust signal regression
    Li, Bin
    Marx, Brian D.
    Chakraborty, Somsubhra
    Weindorf, David C.
    [J]. STATISTICAL MODELLING, 2019, 19 (05) : 524 - 544
  • [7] An efficient method of wavelength interval selection based on random frog for multivariate spectral calibration
    Yun, Yong-Huan
    Li, Hong-Dong
    Wood, Leslie R. E.
    Fan, Wei
    Wang, Jia-Jun
    Cao, Dong-Sheng
    Xu, Qing-Song
    Liang, Yi-Zeng
    [J]. SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2013, 111 : 31 - 36
  • [8] Weighted random-effects regression models with application to interlaboratory calibration
    Gibbons, RD
    Bhaumik, DK
    [J]. TECHNOMETRICS, 2001, 43 (02) : 192 - 198
  • [9] A Novel Method for Disease Prediction: Hybrid of Random Forest and Multivariate Adaptive Regression Splines
    Yao, Dengju
    Yang, Jing
    Zhan, Xiaojuan
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (01) : 170 - 177
  • [10] Machine Learning Predictive Models for Pile Drivability: An Evaluation of Random Forest Regression and Multivariate Adaptive Regression Splines
    Zhang, Wengang
    Wu, Chongzhi
    [J]. INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, : 243 - 255