Outlier detection in astronomical data

被引:6
|
作者
Zhang, YX [1 ]
Luo, A [1 ]
Zhao, YH [1 ]
机构
[1] Chinese Acad Sci, Natl Astron Observ, Beijing 100864, Peoples R China
关键词
outlier-data mining-data mining applications-algorithms-exceptions;
D O I
10.1117/12.550998
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Astronomical data sets have experienced an unprecedented and continuing growth in the volume, quality, and complexity over the past few years, driven by the advances in telescope, detector, and computer technology. Like many other fields, astronomy has become a very data rich science. Information content measured in multiple Terabytes, and even larger, multi Petabyte data sets are on the horizon. To cope with this data flood, Virtual Observatory (VO) federates data archives and services representing a new information infrastructure for astronomy of the 21st century and provides the platform to science discovery. Data mining promises to both make the scientific utilization of these data sets more effective and more complete, and to open completely new avenues of astronomical research. Technological problems range from the issues of database design and federation, to data mining and advanced visualization, leading to a new toolkit for astronomical research. This is similar to challenges encountered in other data intensive fields today. Outlier detection is of great importance. as one of four knowledge discovery tasks. The identification of outliers can often lead to the discovery of truly unexpected knowledge in various fields. Especially in astronomy, the great interest of astronomers is to discover unusual, rare or unknown types of astronomical objects or phenomena. The outlier detection approaches in large datasets correctly meet the need of astronomers. In this paper we provide an overview of some techniques for automated identification of outliers in multivariate data. Outliers often provide useful information. Their identification is important not only for improving the analysis but also for indicating anomalies which may require further investigation. The technique may be used in the process of data preprocessing and also be used for preselecting special object candidates.
引用
收藏
页码:521 / 529
页数:9
相关论文
共 50 条
  • [1] Outlier Detection based on Transformations for Astronomical Time Series
    Romero, Mauricio
    Estevez, Pablo A.
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [2] Outlier detection in interval data
    A. Pedro Duarte Silva
    Peter Filzmoser
    Paula Brito
    Advances in Data Analysis and Classification, 2018, 12 : 785 - 822
  • [3] Outlier detection for skewed data
    Hubert, Mia
    Van der Veeken, Stephan
    JOURNAL OF CHEMOMETRICS, 2008, 22 (3-4) : 235 - 246
  • [4] Outlier detection in skewed data
    Meropi, Pavlidou
    Bikos, Christoforos
    George, Zioutas
    SIMULATION MODELLING PRACTICE AND THEORY, 2018, 87 : 191 - 209
  • [5] Outlier detection in interval data
    Duarte Silva, A. Pedro
    Filzmoser, Peter
    Brito, Paula
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (03) : 785 - 822
  • [6] Outlier detection in transactional data
    Dash, Manoranjan
    Lie, Ng Wil
    INTELLIGENT DATA ANALYSIS, 2010, 14 (03) : 283 - 298
  • [7] ENSEMBLE LEARNING METHOD FOR OUTLIER DETECTION AND ITS APPLICATION TO ASTRONOMICAL LIGHT CURVES
    Nun, Isadora
    Protopapas, Pavlos
    Sim, Brandon
    Chen, Wesley
    ASTRONOMICAL JOURNAL, 2016, 152 (03):
  • [8] Outlier Detection Algorithms in Data Mining
    Xi, Jingke
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 94 - 97
  • [9] Unsupervised outlier detection in multidimensional data
    Atiq ur Rehman
    Samir Brahim Belhaouari
    Journal of Big Data, 8
  • [10] Outlier detection for multivariate categorical data
    Puig, Xavier
    Ginebra, Josep
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2018, 34 (07) : 1400 - 1412