ENSEMBLE REGRESSION COEFFICIENT ANALYSIS FOR APPLICATION TO NEAR-INFRARED SPECTROSCOPY
被引:3
|
作者:
Zheng, Kaiyi
论文数: 0引用数: 0
h-index: 0
机构:E China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
Zheng, Kaiyi
Hu, Huilian
论文数: 0引用数: 0
h-index: 0
机构:E China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
Hu, Huilian
Tong, Peijin
论文数: 0引用数: 0
h-index: 0
机构:E China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
Tong, Peijin
Du, Yiping
论文数: 0引用数: 0
h-index: 0
机构:
E China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
E China Univ Sci & Technol, Res Ctr Anal & Test, Shanghai 200237, Peoples R ChinaE China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
Du, Yiping
[1
,2
]
机构:
[1] E China Univ Sci & Technol, Shanghai Key Lab Funct Mat Chem, Shanghai 200237, Peoples R China
[2] E China Univ Sci & Technol, Res Ctr Anal & Test, Shanghai 200237, Peoples R China
A new variable selection method called ensemble regression coefficient analysis is reported on the basis of model population analysis. In order to construct ensemble regression coefficients, many subsets of variables are randomly selected to calibrate corresponding partial least square models. Based on ensemble theory, the mean of regression coefficients of the models is set as the ensemble regression coefficient. Subsequently, the absolute value of the ensemble regression coefficient can be applied as an informative vector for variable selection. The performance of ensemble regression coefficient analysis was assessed by four near infrared datasets: two simulated datasets, one wheat dataset, and one tobacco dataset. The results showed that this approach can select important variables to obtain fewer errors compared with regression coefficient analysis and Monte Carlo uninformative variable elimination.