Outlier detection in astronomical data

被引:6
|
作者
Zhang, YX [1 ]
Luo, A [1 ]
Zhao, YH [1 ]
机构
[1] Chinese Acad Sci, Natl Astron Observ, Beijing 100864, Peoples R China
关键词
outlier-data mining-data mining applications-algorithms-exceptions;
D O I
10.1117/12.550998
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Astronomical data sets have experienced an unprecedented and continuing growth in the volume, quality, and complexity over the past few years, driven by the advances in telescope, detector, and computer technology. Like many other fields, astronomy has become a very data rich science. Information content measured in multiple Terabytes, and even larger, multi Petabyte data sets are on the horizon. To cope with this data flood, Virtual Observatory (VO) federates data archives and services representing a new information infrastructure for astronomy of the 21st century and provides the platform to science discovery. Data mining promises to both make the scientific utilization of these data sets more effective and more complete, and to open completely new avenues of astronomical research. Technological problems range from the issues of database design and federation, to data mining and advanced visualization, leading to a new toolkit for astronomical research. This is similar to challenges encountered in other data intensive fields today. Outlier detection is of great importance. as one of four knowledge discovery tasks. The identification of outliers can often lead to the discovery of truly unexpected knowledge in various fields. Especially in astronomy, the great interest of astronomers is to discover unusual, rare or unknown types of astronomical objects or phenomena. The outlier detection approaches in large datasets correctly meet the need of astronomers. In this paper we provide an overview of some techniques for automated identification of outliers in multivariate data. Outliers often provide useful information. Their identification is important not only for improving the analysis but also for indicating anomalies which may require further investigation. The technique may be used in the process of data preprocessing and also be used for preselecting special object candidates.
引用
收藏
页码:521 / 529
页数:9
相关论文
共 50 条
  • [21] Universal outlier detection for PIV data
    Westerweel, J
    Scarano, F
    EXPERIMENTS IN FLUIDS, 2005, 39 (06) : 1096 - 1100
  • [22] Outlier Detection for Temporal Data: A Survey
    Gupta, Manish
    Gao, Jing
    Aggarwal, Charu C.
    Han, Jiawei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2250 - 2267
  • [23] Outlier Detection Based on the Data Structure
    Guo, Feng
    Shi, Canghong
    Li, Xiaojie
    He, Jia
    Wu, Xi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [24] Using data images for outlier detection
    Marchette, DJ
    Solka, JL
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 43 (04) : 541 - 552
  • [25] Outlier detection in process plant data
    Chen, J.
    Bandoni, A.
    Romagnoli, J.A.
    Computers and Chemical Engineering, 1998, 22 (4 /5): : 641 - 646
  • [26] Universal outlier detection for PIV data
    Jerry Westerweel
    Fulvio Scarano
    Experiments in Fluids, 2005, 39 : 1096 - 1100
  • [27] Unsupervised outlier detection in multidimensional data
    Ur Rehman, Atiq
    Belhaouari, Samir Brahim
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [28] Outlier detection for high dimensional data
    Aggarwal, CC
    Yu, PS
    SIGMOD RECORD, 2001, 30 (02) : 37 - 46
  • [29] The Influence of Data Preparation on Outlier Detection in Driveability Data
    Ramsauer A.
    Baumann P.M.
    Lex C.
    SN Computer Science, 2021, 2 (3)
  • [30] Outlier detection algorithms in data mining systems
    Petrovskiy, MI
    PROGRAMMING AND COMPUTER SOFTWARE, 2003, 29 (04) : 228 - 237