Data-driven templates with dictionary learning and sparse representations for photometric redshift estimation

被引:0
|
作者
Frontera-Pons, J. [1 ]
Sureau, F. [2 ]
Bobin, J. [1 ]
Kilbinger, M. [1 ]
机构
[1] Univ Paris Saclay, AIM, CEA, Univ Paris Cite,CNR, F-91191 Gif Sur Yvette, France
[2] Univ Paris Saclay, BioMaps, CEA, CNRS,Inserm,SHFJ, F-91400 Orsay, France
关键词
Photometry; Distances and redshift; Data analysis; CFHTLENS; COSMOS;
D O I
10.1016/j.ascom.2023.100735
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The quality of photometric redshift estimates is fundamental for cosmological applications, as the constraints on the cosmological parameters obtained by the photometric surveys strongly rely on their precision and accuracy. In order to obtain reliable redshift estimates, a large number of studies have proposed different algorithms based on SED template fitting and machine learning methodologies. While template fitting strategies do not need spectroscopic data for training, they also provide consistent estimates in most scenarios. On the other hand, machine learning-based approaches succeed at building a mapping function between the input space (composed by magnitudes and other physical parameters) and the target photometric redshift space. These empirical methods lead to more accurate results as long as the spectroscopic ground truth properly represents the actual galaxy population distribution. Thus, combining template fitting and machine learning techniques would allow to leverage the advantages of both methodologies yielding hybrid methods. The goal is to provide a good estimation accuracy while maximizing the photometric range coverage. This is particularly important in the high -z regime, where the spectroscopic training sample is rarely available. We propose in this article a new method to derive data-driven templates from simulated spectra using dictionary learning. As these representations are obtained directly from the data, they allow to capture the physical properties of the galaxies better than theoretical templates. Inspired by hybrid algorithms, the dictionary is built hierarchically and the features of the different types of galaxies are separated. Once the dictionary has been created, the data-driven templates are artificially redshifted and the observed spectra are sparsely decomposed on this dictionary. The value providing the minimum reconstruction error is selected as the photometric redshift estimate. This new technique, astride template fitting and machine learning, builds representations for the galaxy spectra through unsupervised learning and computes the photometric redshifts by & chi;2 minimization. The performance of the algorithm has been evaluated on realistic galaxy photometric simulations as well as on real data from the Canada-France-Hawaii Telescope Lensing Survey. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Data-Driven Sparse System Identification
    Fattahi, Salar
    Sojoudi, Somayeh
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 462 - 469
  • [32] Seismic data denoising based on data-driven tight frame dictionary learning method
    ZHENG Jialiang
    WANG Deli
    ZHANG Liang
    Global Geology, 2020, 23 (04) : 241 - 246
  • [33] Machine Learning for Photometric Redshift Estimation of Quasars with Different Samples
    Zhang, Yanxia
    Jin, Xin
    Zhang, Jingyi
    Zhao, Yongheng
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 294 - 297
  • [34] Photometric redshift estimation on SDSS data using Random Forests
    Carliles, Samuel
    Budavari, Tamas
    Heinis, Sebastien
    Priebe, Carey
    Szalay, Alexander
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVII, 2008, 394 : 521 - +
  • [35] Data-Driven Covariance Estimation
    Rogers, John T., II
    Ball, John E.
    Gurbuz, Ali C.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON PHASED ARRAY SYSTEMS & TECHNOLOGY (PAST), 2022,
  • [36] SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
    Damnjanovic, Ivan
    Davies, Matthew E. P.
    Plumbley, Mark D.
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 418 - 425
  • [37] Data-Driven Topology Estimation
    Weng, Yang
    Faloutsos, Christos
    Ilic, Marija D.
    2014 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2014, : 560 - 565
  • [38] Dictionary learning based on sparse representations for resting-state functional MRI data analysis
    Nguyen, Hien M.
    2017 INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SIGNAL PROCESSING, TELECOMMUNICATIONS & COMPUTING (SIGTELCOM), 2017, : 7 - 10
  • [39] DSFedCon: Dynamic Sparse Federated Contrastive Learning for Data-Driven Intelligent Systems
    Li, Zhengming
    Chen, Jiahui
    Zhang, Peifeng
    Huang, Huiwu
    Li, Guanbin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3343 - 3355
  • [40] Physics constrained learning for data-driven inverse modeling from sparse observations
    Xu, Kailai
    Darve, Eric
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 453