Mixture model based multivariate statistical analysis of multiply censored environmental data

被引:22
|
作者
He, Jianxun [1 ]
机构
[1] Lakehead Univ, Dept Civil Engn, Thunder Bay, ON P7B 5E1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Water quality; Gaussian mixture model; Maximum likelihood estimation; Censored data; Detection limit; WATER-QUALITY CONCENTRATIONS; MAXIMUM-LIKELIHOOD;
D O I
10.1016/j.advwatres.2013.05.001
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Environmental data are commonly constrained by a detection limit (DL) because of the restriction of experimental apparatus. In particular due to the changes of experimental units or assay methods, the observed data are often cut off by more than one DL. Measurements below the DLs are typically replaced by an arbitrary value such as zeros, half of DLs, or DLs for convenience of analysis. However, this method is widely considered unreliable and prone to bias. In contrast, maximum likelihood estimation (MLE) method for censored data has been developed for better performance and statistical justification. However, the existing MLE methods seldom address the multivariate context of censored environmental data especially for water quality. This paper proposes using a mixture model to flexibly approximate the underlying distribution of the observed data due to its good approximation capability and generation mechanism. In particular, Gaussian mixture model (GMM) is mainly focused in this study. To cope with the censored data with multiple DLs, an expectation-maximization (EM) algorithm in a multivariate setting is developed. The proposed statistical analysis approach is verified from both the simulated data and real water quality data. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:15 / 24
页数:10
相关论文
共 50 条
  • [1] STATISTICAL-INFERENCE FROM MULTIPLY CENSORED ENVIRONMENTAL DATA
    ELSHAARAWI, AH
    NADERI, A
    [J]. ENVIRONMENTAL MONITORING AND ASSESSMENT, 1991, 17 (2-3) : 339 - 347
  • [2] Multivariate statistical analysis of environmental data
    Brzezinska, Justyna
    Rybicka, Aneta
    Pelka, Marcin
    [J]. 12TH PROFESSOR ALEKSANDER ZELIAS INTERNATIONAL CONFERENCE ON MODELLING AND FORECASTING OF SOCIO-ECONOMIC PHENOMENA, 2018, 1 : 40 - 49
  • [3] A Competing Risks Model With Multiply Censored Reliability Data Under Multivariate Weibull Distributions
    Fan, Tsai-Hung
    Wang, Yi-Fu
    Ju, She-Kai
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2019, 68 (02) : 462 - 475
  • [4] Multivariate statistical analysis of environmental monitoring data
    Ross, DL
    [J]. GROUND WATER, 1997, 35 (06) : 1050 - 1057
  • [5] A doubly multivariate model for statistical analysis of spatio-temporal environmental data
    Dutilleul, P
    PinelAlloul, B
    [J]. ENVIRONMETRICS, 1996, 7 (06) : 551 - 565
  • [6] ANALYSIS OF MULTIPLY CENSORED RUN-IN DATA
    ELERATH, J
    [J]. PROCEEDINGS ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 1990, (SYM): : 501 - 506
  • [7] A factor mixture analysis model for multivariate binary data
    Cagnone, Silvia
    Viroli, Cinzia
    [J]. STATISTICAL MODELLING, 2012, 12 (03) : 257 - 277
  • [8] Statistical inference for the Burr model based on progressively censored data
    Mousa, MAMA
    Jaheen, ZF
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2002, 43 (10-11) : 1441 - 1449
  • [9] A METHOD FOR GRAPHICAL ANALYSIS OF MULTIPLY CENSORED LIFE DATA
    NELSON, W
    [J]. TECHNOMETRICS, 1969, 11 (01) : 218 - &
  • [10] MEMO: multi-experiment mixture model analysis of censored data
    Geissen, Eva-Maria
    Hasenauer, Jan
    Heinrich, Stephanie
    Hauf, Silke
    Theis, Fabian J.
    Radde, Nicole E.
    [J]. BIOINFORMATICS, 2016, 32 (16) : 2464 - 2472