Linear dimensionality reduction for classification via a sequential Bayes error minimisation with an application to flow meter diagnostics

被引:20
|
作者
Gyamfi, Kojo Sarfo [1 ]
Brusey, James [1 ]
Hunt, Andrew [1 ]
Gaura, Elena [1 ]
机构
[1] Coventry Univ, Fac Engn & Comp, Coventry CV1 5FB, W Midlands, England
关键词
Linear dimensionality reduction; LDA; Heteroscedasticity; Bayes error; Flow meter diagnostics; DISCRIMINANT-ANALYSIS; DIRECT LDA; SYSTEM; BHATTACHARYYA;
D O I
10.1016/j.eswa.2017.09.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised linear dimensionality reduction (LDR) performed prior to classification often improves the accuracy of classification by reducing overfitting and removing multicollinearity. If a Bayes classifier is to be used, then reduction to a dimensionality of K-1 is necessary and sufficient to preserve the classification information in the original feature space for the IC-class problem. However, most of the existing algorithms provide no optimal dimensionality to which to reduce the data, thus classification information can be lost in the reduced space if K-1 dimensions are used. In this paper, we present a novel LDR technique to reduce the dimensionality of the original data to K-1, such that it is well-primed for Bayesian classification. This is done by sequentially constructing linear classifiers that minimise the Bayes error via a gradient descent procedure, under an assumption of normality. We experimentally validate the proposed algorithm on 10 UCI datasets. Our algorithm is shown to be superior in terms of the classification accuracy when compared to existing algorithms including LDR based on Fisher's criterion and the Chernoff criterion. The applicability of our algorithm is then demonstrated by employing it in diagnosing the health states of 2 ultrasonic flow meters. As with the UCI datasets, the proposed algorithm is found to have superior performance to the existing algorithms, achieving classification accuracies of 99.4% and 97.5% on the two flow meters. Such high classification accuracies on the flow meters promise significant cost benefits in oil and gas operations. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:252 / 262
页数:11
相关论文
共 4 条