Analysing spectroscopy data using two-step group penalized partial least squares regression

被引:2
|
作者
Chang, Le [1 ]
Wang, Jiali [2 ]
Woodgate, William [3 ,4 ]
机构
[1] Australia Natl Univ, Coll Business & Econ, Res Sch Finance Acturial Studies & Stat, Canberra, ACT, Australia
[2] Commonwealth Sci & Ind Res Org, Data61, Canberra, ACT, Australia
[3] Commonwealth Sci & Ind Res Org, Land & Water, Canberra, ACT, Australia
[4] Univ Queensland, Sch Earth & Environm Sci, Brisbane, Qld, Australia
基金
澳大利亚研究理事会;
关键词
Dimension reduction; Group lasso; Partial least squares regression; Reflectance spectrum; Spectroscopy; PRINCIPAL COMPONENT; GROUP LASSO; CLASSIFICATION; INDEX;
D O I
10.1007/s10651-021-00496-2
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
A statistical challenge to analyse hyperspectral data is the multicollinearity between spectral bands. Partial least squares (PLS) has been extensively used as a dimensionality reduction technique through constructing lower dimensional latent variables from the spectral bands that correlate with the response variables. However, it does not take into account the grouping structure of the full spectrum where spectral subsets may exhibit distinct relationships with the response variables. We propose a two-step group penalized PLS regression approach by performing a PLS regression on each group of predictors identified from a clustering approach in the first step. In the second step, a group penalty is imposed on the latent components to select the group with the highest predictive power. Our proposed method demonstrated a superior prediction performance, higher R-squared value and faster computation time over other PLS variations when applied to simulations and a real-world observational data set. Interpretations of the model performance are illustrated using the real-world data example of leaf spectra to indirectly quantify leaf traits. The method is implemented in an R package called "groupPLS", which is accessible from github.com/jialiwang1211/groupPLS.
引用
收藏
页码:445 / 467
页数:23
相关论文
共 50 条
  • [41] Partial least squares Cox regression for genome-wide data
    Ståle Nygård
    Ørnulf Borgan
    Ole Christian Lingjærde
    Hege Leite Størvold
    Lifetime Data Analysis, 2008, 14 : 179 - 195
  • [42] Data preprocessing and partial least squares regression analysis for reagentless determination of hemoglobin concentrations using conventional and total transmission spectroscopy
    Kim, YJ
    Kim, S
    Kim, JW
    Yoon, G
    JOURNAL OF BIOMEDICAL OPTICS, 2001, 6 (02) : 177 - 182
  • [43] Brightness-normalized Partial Least Squares Regression for hyperspectral data
    Feilhauer, Hannes
    Asner, Gregory P.
    Martin, Roberta E.
    Schmidtlein, Sebastian
    JOURNAL OF QUANTITATIVE SPECTROSCOPY & RADIATIVE TRANSFER, 2010, 111 (12-13): : 1947 - 1957
  • [44] Data Analysis of Roadway Attributes through Partial Least Squares Regression
    Li, Weiguo
    Zhang, Hanjie
    Du, Xiaoping
    Qian, Kun
    Li, Cuiying
    2010 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND FINANCIAL ENGINEERING (ICIFE), 2010, : 466 - 468
  • [45] Application of partial least squares regression in data analysis of mining subsidence
    FENG Zun-de~(1
    2. Xuzhou Normal University
    Transactions of Nonferrous Metals Society of China, 2005, (S1) : 156 - 158
  • [46] Missing values estimation in microarray data with partial least squares regression
    Yang, Kun
    Li, Jianzhong
    Wang, Chaokun
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 662 - 669
  • [47] A Two-Step Optimization of the Centro-Hermitian Form in Direct Data Domain Least Squares Approach
    Yilmaz, Muhittin
    Yilmazer, Nuri
    Bhumkar, Sunmeel
    Liu, Hongjiang
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [48] Boosted regression trees, multivariate adaptive regression splines and their two-step combinations with multiple linear regression or partial least squares to predict blood-brain barrier passage: A case study
    Deconinck, E.
    Zhang, M. H.
    Petitet, F.
    Dubus, E.
    Ijjaali, I.
    Coomans, D.
    Heyden, Y. Vander
    ANALYTICA CHIMICA ACTA, 2008, 609 (01) : 13 - 23
  • [49] Quantitative analysis of mixed hydrofluoric and nitric acids using Raman spectroscopy with partial least squares regression
    Kang, Gumin
    Lee, Kwangchil
    Park, Haesung
    Lee, Jinho
    Jung, Youngjean
    Kim, Kyoungsik
    Son, Boongho
    Park, Hyoungkuk
    TALANTA, 2010, 81 (4-5) : 1413 - 1417
  • [50] Direct determination of leather dyes by visible reflectance spectroscopy using partial least-squares regression
    Blanco, M
    Canals, T
    Coello, J
    Gené, J
    Iturriaga, H
    Maspoch, S
    ANALYTICA CHIMICA ACTA, 2000, 419 (02) : 209 - 214