Outlier modeling for spectral data reduction

被引:7
|
作者
Agahian, Farnaz [1 ]
Funt, Brian [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
PRINCIPAL COMPONENT ANALYSIS; IMAGE COMPRESSION; REFLECTANCE SPECTRA;
D O I
10.1364/JOSAA.31.001445
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The spectra in spectral reflectance datasets tend to be quite correlated and therefore they can be represented more compactly using standard techniques such as principal components analysis (PCA) as part of a lossy compression strategy. However, the presence of outlier spectra can often increase the overall error of the reconstructed spectra. This paper introduces a new outlier modeling (OM) method that detects, clusters, and separately models outliers with their own set of basis vectors. Outliers are defined in terms of the robust Mahalanobis distance using the fast minimum covariance determinant algorithm as a robust estimator of the multivariate mean and covariance from which it is computed. After removing the outliers from the main dataset, the performance of PCA on the remaining data improves significantly; however, since outlier spectra are a part of the image, they cannot simply be ignored. The solution is to cluster the outliers into a small number of clusters and then model each cluster separately using its own cluster-specific PCA-derived bases. Tests show that OM leads to lower spectral reconstruction errors of reflectance spectra in terms of both normalized RMS and goodness of fit. (C) 2014 Optical Society of America
引用
收藏
页码:1445 / 1452
页数:8
相关论文
共 50 条
  • [1] Outlier Detection based on Data Reduction in WSNs for Water Pipeline
    Ayadi, Aya
    Ghorbel, Oussama
    Bensaleh, M. S.
    Obeid, Abdelfateh
    Abid, Mohamed
    [J]. 2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 281 - 286
  • [2] Outlier resistance, standardization, and modeling issues for DNA microarray data
    Amaratunga, D
    Cabrera, J
    [J]. STATISTICS IN GENETICS AND IN THE ENVIRONMENTAL SCIENCES, 2001, : 17 - 26
  • [3] SPECTRAL DATA REDUCTION - UNCONVENTIONAL APPROACH
    HIRSCHFELD, T
    [J]. RESEARCH-DEVELOPMENT, 1976, 27 (07): : 20 - &
  • [4] Fast outlier mining algorithm in uncertain data set based on spectral clustering
    Kang, Yao-Long
    Feng, Li-Lu
    Zhang, Jing-An
    Cao, Su-E
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (04): : 1181 - 1186
  • [5] Cancer Outlier Analysis Based on Mixture Modeling of Gene Expression Data
    Mori, Keita
    Oura, Tomonori
    Noma, Hisashi
    Matsui, Shigeyuki
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [6] SPECTRAL DATA MODELING FOR A LIGHTING APPLICATION
    DEVILLE, PM
    MERZOUK, S
    CAZIER, D
    PAUL, JC
    [J]. COMPUTER GRAPHICS FORUM, 1994, 13 (03) : C97 - &
  • [7] Lossless compression codec of aurora spectral data using hybrid spatial-spectral decorrelation with outlier recognition
    Kong, Wanqiu
    Wu, Jiaji
    Hu, Zejun
    Jeon, Gwanggil
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 174 - 181
  • [8] A hybrid dimensionality reduction method for outlier detection in high-dimensional data
    Meng, Guanglei
    Wang, Biao
    Wu, Yanming
    Zhou, Mingzhe
    Meng, Tiankuo
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3705 - 3718
  • [9] Data Reduction Using NMF for Outlier Detection Method in Wireless Sensor Networks
    Ghorbel, Oussama
    Alshammari, Hamoud
    Aseeri, Mohammed
    Khdhir, Radhia
    Abid, Mohamed
    [J]. FOURTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 2, 2020, 1027 : 23 - 30
  • [10] A hybrid dimensionality reduction method for outlier detection in high-dimensional data
    Guanglei Meng
    Biao Wang
    Yanming Wu
    Mingzhe Zhou
    Tiankuo Meng
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3705 - 3718