Data-driven templates with dictionary learning and sparse representations for photometric redshift estimation

被引：0

作者：

Frontera-Pons, J. ^{[1
]}

Sureau, F. ^{[2
]}

Bobin, J. ^{[1
]}

Kilbinger, M. ^{[1
]}

机构：

[1] Univ Paris Saclay, AIM, CEA, Univ Paris Cite,CNR, F-91191 Gif Sur Yvette, France

[2] Univ Paris Saclay, BioMaps, CEA, CNRS,Inserm,SHFJ, F-91400 Orsay, France

来源：

ASTRONOMY AND COMPUTING | 2023年 / 44卷

关键词：

Photometry; Distances and redshift; Data analysis; CFHTLENS; COSMOS;

D O I：

10.1016/j.ascom.2023.100735

中图分类号：

P1 [天文学];

学科分类号：

0704 ;

摘要：

The quality of photometric redshift estimates is fundamental for cosmological applications, as the constraints on the cosmological parameters obtained by the photometric surveys strongly rely on their precision and accuracy. In order to obtain reliable redshift estimates, a large number of studies have proposed different algorithms based on SED template fitting and machine learning methodologies. While template fitting strategies do not need spectroscopic data for training, they also provide consistent estimates in most scenarios. On the other hand, machine learning-based approaches succeed at building a mapping function between the input space (composed by magnitudes and other physical parameters) and the target photometric redshift space. These empirical methods lead to more accurate results as long as the spectroscopic ground truth properly represents the actual galaxy population distribution. Thus, combining template fitting and machine learning techniques would allow to leverage the advantages of both methodologies yielding hybrid methods. The goal is to provide a good estimation accuracy while maximizing the photometric range coverage. This is particularly important in the high -z regime, where the spectroscopic training sample is rarely available. We propose in this article a new method to derive data-driven templates from simulated spectra using dictionary learning. As these representations are obtained directly from the data, they allow to capture the physical properties of the galaxies better than theoretical templates. Inspired by hybrid algorithms, the dictionary is built hierarchically and the features of the different types of galaxies are separated. Once the dictionary has been created, the data-driven templates are artificially redshifted and the observed spectra are sparsely decomposed on this dictionary. The value providing the minimum reconstruction error is selected as the photometric redshift estimate. This new technique, astride template fitting and machine learning, builds representations for the galaxy spectra through unsupervised learning and computes the photometric redshifts by & chi;2 minimization. The performance of the algorithm has been evaluated on realistic galaxy photometric simulations as well as on real data from the Canada-France-Hawaii Telescope Lensing Survey. & COPY; 2023 Elsevier B.V. All rights reserved.

引用

页数：12

共 50 条

[41] Data-driven geotechnical site recognition using machine learning and sparse representation
Guan, Zheng
Wang, Yu
Phoon, Kok-Kwang
ENGINEERING GEOLOGY, 2025, 346
[42] Data-Driven Site Characterization for Benchmark Examples Using Sparse Bayesian Learning
Ching, Jianye
GEO-RISK 2023: INNOVATION IN DATA AND ANALYSIS METHODS, 2023, 345 : 438 - 445
[43] Data-Driven Compressive Sampling and Learning Sparse Coding for Hyperspectral Image Classification
Yang, Shuyuan
Jin, HongHong
Wang, Min
Ren, Yu
Jiao, Licheng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (02) : 479 - 483
[44] Data-Driven Design of Biselective Templates for Intergrowth Zeolites
Schwalbe-Koda, Daniel
Corma, Avelino
Roman-Leshkov, Yuriy
Moliner, Manuel
Gomez-Bombarelli, Rafael
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2021, 12 (43): : 10689 - 10694
[45] Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations
Winter, Robin
Montanari, Floriane
Noe, Frank
Clevert, Djork-Arne
CHEMICAL SCIENCE, 2019, 10 (06) : 1692 - 1701
[46] Data-driven learning of 3-point correlation functions as microstructure representations
Cheng, Sheng
Jiao, Yang
Ren, Yi
ACTA MATERIALIA, 2022, 229
[47] Data-driven sparse partial least squares
Lorenzo, Hadrien
Cloarec, Olivier
Thiebaut, Rodolphe
Saracco, Jerome
STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (02) : 264 - 282
[48] Data-Driven Human Modeling by Sparse Representation
Wu, Yiu-Bun
Liu, Bin
Liu, Xiuping
Wang, Charlie C. L.
COMPUTER-AIDED DESIGN, 2020, 128
[49] Direct data-driven design of sparse controllers
Formentin, Simone
Karimi, Alireza
2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 3099 - 3104
[50] A novel weighted sparse classification framework with extended discriminative dictionary for data-driven bearing fault diagnosis
Cui, Lingli
Jiang, Zhichao
Liu, Dongdong
Zhen, Dong
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 222

← 1 2 3 4 5 →