Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

被引：0

作者：

Zhang, Yizhou ^{[1
]}

Ni, Jingchao ^{[2
]}

Cheng, Wei ^{[3
]}

Chen, Zhengzhang ^{[3
]}

Tong, Liang ^{[4
]}

Chen, Haifeng ^{[3
]}

Liu, Yan ^{[1
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90007 USA

[2] AWS AI Labs, Seattle, WA USA

[3] NEC Labs Amer, Irving, TX USA

[4] Stellar Cyber Inc, Seoul, South Korea

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking two critical needs of a practical solution: (1) the various sources of tasks may compose a multi-component mixture distribution, and (2) novel tasks may come from a distribution that is unseen during meta-training. In this paper, we demonstrate these two challenges can be solved jointly by modeling the density of task instances. We develop a metatraining framework underlain by a novel Hierarchical Gaussian Mixture based Task Generative Model (HTGM). HTGM extends the widely used empirical process of sampling tasks to a theoretical model, which learns task embeddings, fits the mixture distribution of tasks, and enables density-based scoring of novel tasks. The framework is agnostic to the encoder and scales well with large backbone networks. The model parameters are learned end-to-end by maximum likelihood estimation via an Expectation-Maximization (EM) algorithm. Extensive experiments on benchmark datasets indicate the effectiveness of our method for both sample classification and novel task detection.

引用

页数：24

共 50 条

[21] Regularizing Neural Networks with Meta-Learning Generative Models
Yamaguchi, Shin'ya
Chijiwa, Daiki
Kanai, Sekitoshi
Kumagai, Atsutoshi
Kashima, Hisashi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[22] A Voice Morphing Model Based on the Gaussian Mixture Model and Generative Topographic Mapping
Rassam, Murad A.
Almekhlafi, Rasha
Alosaily, Eman
Hassan, Haneen
Hassan, Reem
Saeed, Eman
Alqershi, Elham
EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 396 - 406
[23] Hierarchical Bayes based Adaptive Sparsity in Gaussian Mixture Model
Wang, Binghui
Lin, Chuang
Fan, Xin
Jiang, Ning
Farina, Dario
PATTERN RECOGNITION LETTERS, 2014, 49 : 238 - 247
[24] Incremental Learning of Skills in a Task-Parameterized Gaussian Mixture Model
Hoyos, Jose
Prieto, Flavio
Alenya, Guillem
Torras, Carme
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 82 (01) : 81 - 99
[25] Incremental Learning of Skills in a Task-Parameterized Gaussian Mixture Model
Jose Hoyos
Flavio Prieto
Guillem Alenyà
Carme Torras
Journal of Intelligent & Robotic Systems, 2016, 82 : 81 - 99
[26] TASK2VEC: Task Embedding for Meta-Learning
Achille, Alessandro
Lam, Michael
Tewari, Rahul
Ravichandran, Avinash
Maji, Subhransu
Fowlkes, Charless
Soatto, Stefano
Perona, Pietro
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448
[27] Set-based Meta-Interpolation for Few-Task Meta-Learning
Lee, Seanie
Andreis, Bruno
Kawaguchi, Kenji
Lee, Juho
Hwang, Sung Ju
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[28] Learning Task-Distribution Reward Shaping with Meta-Learning
Zou, Haosheng
Ren, Tongzheng
Yan, Dong
Su, Hang
Zhu, Jun
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11210 - 11218
[29] Task Agnostic Meta-Learning for Few-Shot Learning
Jamal, Muhammad Abdullah
Qi, Guo-Jun
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
[30] Incremental Object Classification Using Hierarchical Generative Gaussian Mixture and Topology Based Feature Representation
Jeong, Sungmoon
Lee, Minho
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 925 - 932

← 1 2 3 4 5 →