Outlier detection in astronomical data

被引:6
|
作者
Zhang, YX [1 ]
Luo, A [1 ]
Zhao, YH [1 ]
机构
[1] Chinese Acad Sci, Natl Astron Observ, Beijing 100864, Peoples R China
关键词
outlier-data mining-data mining applications-algorithms-exceptions;
D O I
10.1117/12.550998
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Astronomical data sets have experienced an unprecedented and continuing growth in the volume, quality, and complexity over the past few years, driven by the advances in telescope, detector, and computer technology. Like many other fields, astronomy has become a very data rich science. Information content measured in multiple Terabytes, and even larger, multi Petabyte data sets are on the horizon. To cope with this data flood, Virtual Observatory (VO) federates data archives and services representing a new information infrastructure for astronomy of the 21st century and provides the platform to science discovery. Data mining promises to both make the scientific utilization of these data sets more effective and more complete, and to open completely new avenues of astronomical research. Technological problems range from the issues of database design and federation, to data mining and advanced visualization, leading to a new toolkit for astronomical research. This is similar to challenges encountered in other data intensive fields today. Outlier detection is of great importance. as one of four knowledge discovery tasks. The identification of outliers can often lead to the discovery of truly unexpected knowledge in various fields. Especially in astronomy, the great interest of astronomers is to discover unusual, rare or unknown types of astronomical objects or phenomena. The outlier detection approaches in large datasets correctly meet the need of astronomers. In this paper we provide an overview of some techniques for automated identification of outliers in multivariate data. Outliers often provide useful information. Their identification is important not only for improving the analysis but also for indicating anomalies which may require further investigation. The technique may be used in the process of data preprocessing and also be used for preselecting special object candidates.
引用
收藏
页码:521 / 529
页数:9
相关论文
共 50 条
  • [41] Outlier Detection in Cellular Network Data Exploration
    Multanen, Mikko
    Raivio, Kimmo
    Lehtimaki, Pasi
    2008 22ND INTERNATIONAL WORKSHOPS ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOLS 1-3, 2008, : 1323 - 1328
  • [42] Discussion of Outlier Detection Methods of Purchasing Data
    Kono, Katsuya
    Yamamoto, Yoshiro
    2016 14TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2016, : 12 - 18
  • [43] Outlier and anomaly pattern detection on data streams
    Park, Cheong Hee
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 6118 - 6128
  • [44] Outlier Detection Algorithms in Data Mining Systems
    M. I. Petrovskiy
    Programming and Computer Software, 2003, 29 : 228 - 237
  • [45] Attribute Outlier Detection over Data Streams
    Cao, Hui
    Zhou, Yongluan
    Shou, Lidan
    Chen, Gang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 216 - +
  • [46] Differentially Private Outlier Detection in Correlated Data
    Degue, Kwassi H.
    Gopalakrishnan, Karthik
    Li, Max Z.
    Balakrishnan, Hamsa
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2735 - 2742
  • [47] Outlier Detection in Streaming Data A research Perspective
    Chugh, Neeraj
    Chugh, Mitali
    Agarwal, Alok
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 429 - 432
  • [48] Outlier detection in chemical data by fractal analysis
    Cramer, JA
    Shah, SS
    Battaglia, TM
    Banerji, SN
    Obando, LA
    Booksh, KS
    JOURNAL OF CHEMOMETRICS, 2004, 18 (7-8) : 317 - 326
  • [49] Outlier detection from multiple data sources
    Ma, Yang
    Zhao, Xujun
    Zhang, Chaowei
    Zhang, Jifu
    Qin, Xiao
    INFORMATION SCIENCES, 2021, 580 : 819 - 837
  • [50] Robust transformations and outlier detection with autocorrelated data
    Cerioli, A
    Riani, M
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 262 - +