Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

被引:0
|
作者
Zhang, Yizhou [1 ]
Ni, Jingchao [2 ]
Cheng, Wei [3 ]
Chen, Zhengzhang [3 ]
Tong, Liang [4 ]
Chen, Haifeng [3 ]
Liu, Yan [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] AWS AI Labs, Seattle, WA USA
[3] NEC Labs Amer, Irving, TX USA
[4] Stellar Cyber Inc, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking two critical needs of a practical solution: (1) the various sources of tasks may compose a multi-component mixture distribution, and (2) novel tasks may come from a distribution that is unseen during meta-training. In this paper, we demonstrate these two challenges can be solved jointly by modeling the density of task instances. We develop a metatraining framework underlain by a novel Hierarchical Gaussian Mixture based Task Generative Model (HTGM). HTGM extends the widely used empirical process of sampling tasks to a theoretical model, which learns task embeddings, fits the mixture distribution of tasks, and enables density-based scoring of novel tasks. The framework is agnostic to the encoder and scales well with large backbone networks. The model parameters are learned end-to-end by maximum likelihood estimation via an Expectation-Maximization (EM) algorithm. Extensive experiments on benchmark datasets indicate the effectiveness of our method for both sample classification and novel task detection.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Regularizing Neural Networks with Meta-Learning Generative Models
    Yamaguchi, Shin'ya
    Chijiwa, Daiki
    Kanai, Sekitoshi
    Kumagai, Atsutoshi
    Kashima, Hisashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] A Voice Morphing Model Based on the Gaussian Mixture Model and Generative Topographic Mapping
    Rassam, Murad A.
    Almekhlafi, Rasha
    Alosaily, Eman
    Hassan, Haneen
    Hassan, Reem
    Saeed, Eman
    Alqershi, Elham
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 396 - 406
  • [23] Hierarchical Bayes based Adaptive Sparsity in Gaussian Mixture Model
    Wang, Binghui
    Lin, Chuang
    Fan, Xin
    Jiang, Ning
    Farina, Dario
    PATTERN RECOGNITION LETTERS, 2014, 49 : 238 - 247
  • [24] Incremental Learning of Skills in a Task-Parameterized Gaussian Mixture Model
    Hoyos, Jose
    Prieto, Flavio
    Alenya, Guillem
    Torras, Carme
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 82 (01) : 81 - 99
  • [25] Incremental Learning of Skills in a Task-Parameterized Gaussian Mixture Model
    Jose Hoyos
    Flavio Prieto
    Guillem Alenyà
    Carme Torras
    Journal of Intelligent & Robotic Systems, 2016, 82 : 81 - 99
  • [26] TASK2VEC: Task Embedding for Meta-Learning
    Achille, Alessandro
    Lam, Michael
    Tewari, Rahul
    Ravichandran, Avinash
    Maji, Subhransu
    Fowlkes, Charless
    Soatto, Stefano
    Perona, Pietro
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448
  • [27] Set-based Meta-Interpolation for Few-Task Meta-Learning
    Lee, Seanie
    Andreis, Bruno
    Kawaguchi, Kenji
    Lee, Juho
    Hwang, Sung Ju
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [28] Learning Task-Distribution Reward Shaping with Meta-Learning
    Zou, Haosheng
    Ren, Tongzheng
    Yan, Dong
    Su, Hang
    Zhu, Jun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11210 - 11218
  • [29] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [30] Incremental Object Classification Using Hierarchical Generative Gaussian Mixture and Topology Based Feature Representation
    Jeong, Sungmoon
    Lee, Minho
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 925 - 932