An R package for identification of outliers in environmental time series data

被引:7
|
作者
Campulova, Martina [1 ]
Campula, Roman [2 ]
Holesovsky, Jan [3 ]
机构
[1] Mendel Univ Brno, Fac Econ, Dept Stat & Operat Res, Zemedelska 1, Brno 61300, Czech Republic
[2] Transport Res Ctr, CDV, Lisenska 33a, Brno 63600, Czech Republic
[3] Brno Univ Technol, Inst Math & Descript Geometry, Fac Civil Engn, Veveri 95, Brno 60200, Czech Republic
关键词
Outlier; Data validation; Kernel regression; Environmental data; R package; CHANGE-POINT ANALYSIS; BINARY SEGMENTATION; MULTIPLE; ESTIMATOR;
D O I
10.1016/j.envsoft.2022.105435
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Environmental data often include outliers that may significantly affect further modelling and data analysis. Although a number of outlier detection methods have been proposed, their use is usually complicated by the assumption of the distribution or model of the analyzed data. However, environmental variables are quite often influenced by many different factors and their distribution is difficult to estimate. The envoutliers package has been developed to provide users with a choice of recently presented, semi-parametric outlier detection methods that do not impose requirements on the distribution of the original data. This paper briefly describes the methodology as well as its implementation in the package. The application is illustrated on real data examples.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] svars: An R Package for Data-Driven Identification in Multivariate Time Series Analysis
    Lange, Alexander
    Dalheimer, Bernhard
    Herwartz, Helmut
    Maxand, Simone
    JOURNAL OF STATISTICAL SOFTWARE, 2021, 97 (05): : 1 - 34
  • [2] OUTLIERS IN TIME SERIES DATA
    Deneshkumar, V.
    Kannan, K. Senthamarai
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2011, 7 (02): : 685 - 691
  • [3] ENVIRONMENTAL DATA MANAGEMENT - IDENTIFICATION OF OUTLIERS
    MARSDEN, JR
    PINGRY, DE
    WHINSTON, AB
    JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 1976, 3 (02) : 154 - 163
  • [4] Identification and Correction of Outliers in Wind Farm Time Series Power Data
    Ye, Xi
    Lu, Zongxiang
    Qiao, Ying
    Min, Yong
    O'Malley, Mark
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2016, 31 (06) : 4197 - 4205
  • [5] tsrobprep - an R package for robust preprocessing of time series data
    Narajewski, Michal
    Kley-Holsteg, Jens
    Ziel, Florian
    SOFTWAREX, 2021, 16
  • [6] Nonparametric algorithm for identification of outliers in environmental data
    Campulova, Martina
    Michalek, Jaroslav
    Mikuska, Pavel
    Bokal, Drago
    JOURNAL OF CHEMOMETRICS, 2018, 32 (05)
  • [7] Outliers detect methods for time series data
    Liang, T. X.
    Cao, C. X.
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (04): : 927 - 936
  • [8] DETECTING OUTLIERS IN TIME-SERIES DATA
    CHERNICK, MR
    DOWNING, DJ
    PIKE, DH
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1982, 77 (380) : 743 - 747
  • [9] Outliers Mining in Time Series Data Sets
    Zheng Binxiang
    Journal of Systems Engineering and Electronics, 2002, (01) : 93 - 97
  • [10] Outliers in financial time series data: Outliers, margin debt, and economic recession
    Lee, Kangbok
    Jeong, Yeasung
    Joo, Sunghoon
    Yoon, Yeo Song
    Han, Sumin
    Baik, Hyeoncheol
    MACHINE LEARNING WITH APPLICATIONS, 2022, 10