Fast and Scalable Gaussian Process Modeling with Applications to Astronomical Time Series

被引:600
|
作者
Foreman-Mackey, Daniel [1 ,2 ]
Agol, Eric [1 ]
Ambikasaran, Sivaram [3 ]
Angus, Ruth [4 ]
机构
[1] Univ Washington, Dept Astron, Seattle, WA 98195 USA
[2] Flatiron Inst, Ctr Computat Astrophys, 162 5th Ave,6th Floor, New York, NY 10010 USA
[3] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
[4] Columbia Univ, Dept Astron, 550 W 120th St, New York, NY 10027 USA
来源
ASTRONOMICAL JOURNAL | 2017年 / 154卷 / 06期
基金
美国国家科学基金会;
关键词
asteroseismology; methods: data analysis; methods: statistical; planetary systems; stars: rotation; PROCESS FRAMEWORK; LIGHT CURVES; KEPLER; VARIABILITY; EFFICIENT; ASTEROSEISMOLOGY; OSCILLATIONS; ALGORITHM; BINARIES; SPECTRA;
D O I
10.3847/1538-3881/aa9332
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The growing field of large-scale time domain astronomy requires methods for probabilistic data analysis that are computationally tractable, even with large data sets. Gaussian processes (GPs) are a popular class of models used for this purpose, but since the computational cost scales, in general, as the cube of the number of data points, their application has been limited to small data sets. In this paper, we present a novel method for GPs modeling in one dimension where the computational requirements scale linearly with the size of the data set. We demonstrate the method by applying it to simulated and real astronomical time series data sets. These demonstrations are examples of probabilistic inference of stellar rotation periods, asteroseismic oscillation spectra, and transiting planet parameters. The method exploits structure in the problem when the covariance function is expressed as a mixture of complex exponentials, without requiring evenly spaced observations or uniform noise. This form of covariance arises naturally when the process is a mixture of stochastically driven damped harmonic oscillators-providing a physical motivation for and interpretation of this choice-but we also demonstrate that it can be a useful effective model in some other cases. We present a mathematical description of the method and compare it to existing scalable GP methods. The method is fast and interpretable, with a range of potential applications within astronomical data analysis and beyond. We provide well-tested and documented open-source implementations of this method in C++, Python, and Julia.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Gaussian Process Regression for Astronomical Time Series
    Aigrain, Suzanne
    Foreman-Mackey, Daniel
    [J]. ANNUAL REVIEW OF ASTRONOMY AND ASTROPHYSICS, 2023, 61 : 329 - 371
  • [2] TOWARDS SCALABLE GAUSSIAN PROCESS MODELING
    Pandita, Piyush
    Kristensen, Jesper
    Wang, Liping
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 2B, 2020,
  • [3] Modeling of Multivariate Time Series Using Variable Selection and Gaussian Process
    Ren Weijie
    Han Min
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 5071 - 5074
  • [4] Robust and Scalable Gaussian Process Regression and Its Applications
    Lu, Yifan
    Ma, Jiayi
    Fang, Leyuan
    Tian, Xin
    Jiang, Junjun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21950 - 21959
  • [5] A scalable end-to-end Gaussian process adapter for irregularly sampled time series classification
    Li, Steven Cheng-Xian
    Marlin, Benjamin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [6] Modeling astronomical time series with stochastic differential equations
    Vio, R.
    Andreani, P.
    Madsen, H.
    Mannella, R.
    Rebusco, P.
    [J]. STATISTICAL CHALLENGES IN MODERN ASTRONOMY IV, 2007, 371 : 439 - +
  • [7] Sailfish: A Fast Bayesian Change Point Detection Framework with Gaussian Process for Time Series
    Du, Haizhou
    Zheng, Yang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 740 - 751
  • [8] Techniques for Fast and Scalable Time Series Traffic Generation
    Kuang, Jilong
    Waddington, Daniel G.
    Lin, Changhui
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 105 - 114
  • [9] Scalable Fully Bayesian Gaussian Process Modeling and Calibration With Adaptive Sequential Monte Carlo for Industrial Applications
    Pandita, Piyush
    Tsilifis, Panagiotis
    Ghosh, Sayan
    Wang, Liping
    [J]. JOURNAL OF MECHANICAL DESIGN, 2021, 143 (07)
  • [10] Gaussian process for nonstationary time series prediction
    Brahim-Belhouari, S
    Bermak, A
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (04) : 705 - 712