Scalable Variational Gaussian Processes for Crowdsourcing: Glitch Detection in LIGO

被引:0
|
作者
Morales-Alvarez, Pablo [1 ]
Ruiz, Pablo [2 ]
Coughlin, Scott [3 ,4 ]
Molina, Rafael [1 ]
Katsaggelos, Aggelos K. [2 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18010, Spain
[2] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[3] Northwestern Univ, Ctr Interdisciplinary Explorat & Res Astrophys CI, Evanston, IL 60208 USA
[4] Cardiff Univ, Dept Phys & Astron, Cardiff CF10 3AT, Wales
基金
美国国家科学基金会;
关键词
Crowdsourcing; Training; Probabilistic logic; Gaussian processes; Machine learning; Uncertainty; Bayes methods; citizen science; laser interferometer gravitational waves observatory; sparse Gaussian processes; scalability; uncertainty quantification; deep learning; CITIZEN SCIENCE; CLASSIFICATION;
D O I
10.1109/TPAMI.2020.3025390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last years, crowdsourcing is transforming the way classification training sets are obtained. Instead of relying on a single expert annotator, crowdsourcing shares the labelling effort among a large number of collaborators. For instance, this is being applied in the laureate laser interferometer gravitational waves observatory (LIGO), in order to detect glitches which might hinder the identification of true gravitational-waves. The crowdsourcing scenario poses new challenging difficulties, as it has to deal with different opinions from a heterogeneous group of annotators with unknown degrees of expertise. Probabilistic methods, such as Gaussian processes (GP), have proven successful in modeling this setting. However, GPs do not scale up well to large data sets, which hampers their broad adoption in real-world problems (in particular LIGO). This has led to the very recent introduction of deep learning based crowdsourcing methods, which have become the state-of-the-art for this type of problems. However, the accurate uncertainty quantification provided by GPs has been partially sacrificed. This is an important aspect for astrophysicists in LIGO, since a glitch detection system should provide very accurate probability distributions of its predictions. In this work, we first leverage a standard sparse GP approximation (SVGP) to develop a GP-based crowdsourcing method that factorizes into mini-batches. This makes it able to cope with previously-prohibitive data sets. This first approach, which we refer to as scalable variational Gaussian processes for crowdsourcing (SVGPCR), brings back GP-based methods to a state-of-the-art level, and excels at uncertainty quantification. SVGPCR is shown to outperform deep learning based methods and previous probabilistic ones when applied to the LIGO data. Its behavior and main properties are carefully analyzed in a controlled experiment based on the MNIST data set. Moreover, recent GP inference techniques are also adapted to crowdsourcing and evaluated experimentally.
引用
收藏
页码:1534 / 1551
页数:18
相关论文
共 50 条
  • [41] Scalable Gaussian Processes for Characterizing Multidimensional Change Surfaces
    Herlands, William
    Wilson, Andrew
    Nickisch, Hannes
    Flaxman, Seth
    Neill, Daniel
    van Panhuis, Wilbert
    Xing, Eric
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1013 - 1021
  • [42] Recursive Estimation of Dynamic RSS Fields Based on Crowdsourcing and Gaussian Processes
    Santos, Irene
    Jose Murillo-Fuentes, Juan
    Djuric, Petar M.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (05) : 1152 - 1162
  • [43] Learning spatial patterns with variational Gaussian processes: Regression
    Goncalves, Italo Gomes
    Guadagnin, Felipe
    Cordova, Diogo Peixoto
    [J]. COMPUTERS & GEOSCIENCES, 2022, 161
  • [44] A variational hardcut EM algorithm for the mixtures of Gaussian processes
    Tao Li
    Jinwen Ma
    [J]. Science China Information Sciences, 2023, 66
  • [45] A variational hardcut EM algorithm for the mixtures of Gaussian processes
    Li, Tao
    Ma, Jinwen
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (03)
  • [46] A variational hardcut EM algorithm for the mixtures of Gaussian processes
    Tao LI
    Jinwen MA
    [J]. Science China(Information Sciences), 2023, 66 (03) : 291 - 292
  • [47] A stochastic variational framework for Recurrent Gaussian Processes models
    Mattos, Cesar Lincoln C.
    Barreto, Guilherme A.
    [J]. NEURAL NETWORKS, 2019, 112 : 54 - 72
  • [48] Doubly Stochastic Variational Inference for Deep Gaussian Processes
    Salimbeni, Hugh
    Deisenroth, Marc Peter
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [49] Variational Bayesian Multiple Instance Learning with Gaussian Processes
    Haussmann, Manuel
    Hamprecht, Fred A.
    Kandemir, Melih
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 810 - 819
  • [50] Variational Inference for Gaussian Process Modulated Poisson Processes
    Lloyd, Chris
    Gunter, Tom
    Osborne, Michael A.
    Roberts, Stephen J.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1814 - 1822