Data-driven templates with dictionary learning and sparse representations for photometric redshift estimation

被引:0
|
作者
Frontera-Pons, J. [1 ]
Sureau, F. [2 ]
Bobin, J. [1 ]
Kilbinger, M. [1 ]
机构
[1] Univ Paris Saclay, AIM, CEA, Univ Paris Cite,CNR, F-91191 Gif Sur Yvette, France
[2] Univ Paris Saclay, BioMaps, CEA, CNRS,Inserm,SHFJ, F-91400 Orsay, France
关键词
Photometry; Distances and redshift; Data analysis; CFHTLENS; COSMOS;
D O I
10.1016/j.ascom.2023.100735
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The quality of photometric redshift estimates is fundamental for cosmological applications, as the constraints on the cosmological parameters obtained by the photometric surveys strongly rely on their precision and accuracy. In order to obtain reliable redshift estimates, a large number of studies have proposed different algorithms based on SED template fitting and machine learning methodologies. While template fitting strategies do not need spectroscopic data for training, they also provide consistent estimates in most scenarios. On the other hand, machine learning-based approaches succeed at building a mapping function between the input space (composed by magnitudes and other physical parameters) and the target photometric redshift space. These empirical methods lead to more accurate results as long as the spectroscopic ground truth properly represents the actual galaxy population distribution. Thus, combining template fitting and machine learning techniques would allow to leverage the advantages of both methodologies yielding hybrid methods. The goal is to provide a good estimation accuracy while maximizing the photometric range coverage. This is particularly important in the high -z regime, where the spectroscopic training sample is rarely available. We propose in this article a new method to derive data-driven templates from simulated spectra using dictionary learning. As these representations are obtained directly from the data, they allow to capture the physical properties of the galaxies better than theoretical templates. Inspired by hybrid algorithms, the dictionary is built hierarchically and the features of the different types of galaxies are separated. Once the dictionary has been created, the data-driven templates are artificially redshifted and the observed spectra are sparsely decomposed on this dictionary. The value providing the minimum reconstruction error is selected as the photometric redshift estimate. This new technique, astride template fitting and machine learning, builds representations for the galaxy spectra through unsupervised learning and computes the photometric redshifts by & chi;2 minimization. The performance of the algorithm has been evaluated on realistic galaxy photometric simulations as well as on real data from the Canada-France-Hawaii Telescope Lensing Survey. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Data-driven geotechnical site recognition using machine learning and sparse representation
    Guan, Zheng
    Wang, Yu
    Phoon, Kok-Kwang
    ENGINEERING GEOLOGY, 2025, 346
  • [42] Data-Driven Site Characterization for Benchmark Examples Using Sparse Bayesian Learning
    Ching, Jianye
    GEO-RISK 2023: INNOVATION IN DATA AND ANALYSIS METHODS, 2023, 345 : 438 - 445
  • [43] Data-Driven Compressive Sampling and Learning Sparse Coding for Hyperspectral Image Classification
    Yang, Shuyuan
    Jin, HongHong
    Wang, Min
    Ren, Yu
    Jiao, Licheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (02) : 479 - 483
  • [44] Data-Driven Design of Biselective Templates for Intergrowth Zeolites
    Schwalbe-Koda, Daniel
    Corma, Avelino
    Roman-Leshkov, Yuriy
    Moliner, Manuel
    Gomez-Bombarelli, Rafael
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2021, 12 (43): : 10689 - 10694
  • [45] Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations
    Winter, Robin
    Montanari, Floriane
    Noe, Frank
    Clevert, Djork-Arne
    CHEMICAL SCIENCE, 2019, 10 (06) : 1692 - 1701
  • [46] Data-driven learning of 3-point correlation functions as microstructure representations
    Cheng, Sheng
    Jiao, Yang
    Ren, Yi
    ACTA MATERIALIA, 2022, 229
  • [47] Data-driven sparse partial least squares
    Lorenzo, Hadrien
    Cloarec, Olivier
    Thiebaut, Rodolphe
    Saracco, Jerome
    STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (02) : 264 - 282
  • [48] Data-Driven Human Modeling by Sparse Representation
    Wu, Yiu-Bun
    Liu, Bin
    Liu, Xiuping
    Wang, Charlie C. L.
    COMPUTER-AIDED DESIGN, 2020, 128
  • [49] Direct data-driven design of sparse controllers
    Formentin, Simone
    Karimi, Alireza
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 3099 - 3104
  • [50] A novel weighted sparse classification framework with extended discriminative dictionary for data-driven bearing fault diagnosis
    Cui, Lingli
    Jiang, Zhichao
    Liu, Dongdong
    Zhen, Dong
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 222