GPZ: non-stationary sparse Gaussian processes for heteroscedastic uncertainty estimation in photometric redshifts

被引:85
|
作者
Almosallam, Ibrahim A. [1 ,2 ]
Jarvis, Matt J. [3 ,4 ]
Roberts, Stephen J. [2 ]
机构
[1] King Abdulaziz City Sci & Technol, Riyadh 1142, Saudi Arabia
[2] Parks Rd, Oxford OX1 3PJ, England
[3] Oxford Astrophys, Dept Phys, Keble Rd, Oxford OX1 3RH, England
[4] Univ Western Cape, Dept Phys, ZA-7535 Bellville, South Africa
关键词
methods: data analysis; galaxies: distances and redshifts; PROCESS REGRESSION; PREDICTION; SDSS;
D O I
10.1093/mnras/stw1618
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The next generation of cosmology experiments will be required to use photometric redshifts rather than spectroscopic redshifts. Obtaining accurate and well-characterized photometric redshift distributions is therefore critical for Euclid, the Large Synoptic Survey Telescope and the Square Kilometre Array. However, determining accurate variance predictions alongside single point estimates is crucial, as they can be used to optimize the sample of galaxies for the specific experiment (e.g. weak lensing, baryon acoustic oscillations, supernovae), trading off between completeness and reliability in the galaxy sample. The various sources of uncertainty in measurements of the photometry and redshifts put a lower bound on the accuracy that any model can hope to achieve. The intrinsic uncertainty associated with estimates is often non-uniform and input-dependent, commonly known in statistics as heteroscedastic noise. However, existing approaches are susceptible to outliers and do not take into account variance induced by non-uniform data density and in most cases require manual tuning of many parameters. In this paper, we present a Bayesian machine learning approach that jointly optimizes the model with respect to both the predictive mean and variance we refer to as Gaussian processes for photometric redshifts (GPZ). The predictive variance of the model takes into account both the variance due to data density and photometric noise. Using the Sloan Digital Sky Survey (SDSS) DR12 data, we show that our approach substantially outperforms other machine learning methods for photo-z estimation and their associated variance, such as TPZ and ANNZ2. We provide a MATLAB and PYTHON implementations that are available to download at https://github.com/OxfordML/GPz.
引用
收藏
页码:726 / 739
页数:14
相关论文
共 50 条
  • [31] Predicting non-stationary processes
    Ryabko, Daniil
    Hutter, Marcus
    APPLIED MATHEMATICS LETTERS, 2008, 21 (05) : 477 - 482
  • [32] Surveillance of non-stationary processes
    Taras Lazariv
    Wolfgang Schmid
    AStA Advances in Statistical Analysis, 2019, 103 : 305 - 331
  • [33] Harmonics estimation of power signals in presence of non-Gaussian and non-stationary noise
    Yadav, Pravir
    Piri, Jayanta
    Sengupta, Aparajita
    Sengupta, Mainak
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [34] Hierarchical Sparse Estimation of Non-Stationary Channel for Uplink Massive MIMO Systems
    Tan, Chongyang
    Cai, Donghong
    Fang, Fang
    Shan, Jiahao
    Xu, Yanqing
    Ding, Zhiguo
    Fan, Pingzhi
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6640 - 6645
  • [35] Non-stationary Sparse Fading Channel Estimation for Next Generation Mobile Systems
    Dehgan, Saadat
    Ghobadi, Changiz
    Nourinia, Javad
    Yang, Jie
    Gui, Guan
    Mostafapour, Ehsan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (03): : 1047 - 1062
  • [36] Non-stationary noise estimation using dictionary learning and Gaussian mixture models
    Hughes, James M.
    Rockmore, Daniel N.
    Wang, Yang
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS XII, 2014, 9019
  • [37] A sparse Gaussian process framework for photometric redshift estimation
    Almosallam, Ibrahim A.
    Lindsay, Sam N.
    Jarvis, Matt J.
    Roberts, Stephen J.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 455 (03) : 2387 - 2401
  • [38] Current Derivative Estimation of Non-stationary Processes Based on Metrical Information
    Kochegurova, Elena
    Gorokhova, Ekaterina
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 512 - 519
  • [39] USE OF NON-STATIONARY STOCHASTIC PROCESSES IN ORE RESERVE RELIABILITY ESTIMATION
    BORGMAN, LE
    CANADIAN MINING AND METALLURGICAL BULLETIN, 1970, 63 (696): : 441 - &
  • [40] Maximum likelihood estimation for nearly non-stationary stable autoregressive processes
    Zhang, Rong-Mao
    Chan, Ngai Hang
    JOURNAL OF TIME SERIES ANALYSIS, 2012, 33 (04) : 542 - 553