Robot Concept Acquisition Based on Interaction Between Probabilistic and Deep Generative Models

被引:1
|
作者
Kuniyasu, Ryo [1 ]
Nakamura, Tomoaki [1 ]
Taniguchi, Tadahiro [2 ]
Nagai, Takayuki [3 ,4 ]
机构
[1] Univ ElectroCommunicat, Dept Mech Engn & Intelligent Syst, Tokyo, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Shiga, Japan
[3] Osaka Univ, Grad Sch Engn Sci, Dept Syst Innovat, Osaka, Japan
[4] Univ ElectroCommunicat, Artificial Intelligence EXplorat Res Ctr, Tokyo, Japan
来源
关键词
concept formation; symbol emergence in robotics; probabilistic generative model; deep generative model; unsupervised learning; representation learning; cross-modal inference;
D O I
10.3389/fcomp.2021.618069
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a method for multimodal concept formation. In this method, unsupervised multimodal clustering and cross-modal inference, as well as unsupervised representation learning, can be performed by integrating the multimodal latent Dirichlet allocation (MLDA)-based concept formation and variational autoencoder (VAE)-based feature extraction. Multimodal clustering, representation learning, and cross-modal inference are critical for robots to form multimodal concepts from sensory data. Various models have been proposed for concept formation. However, in previous studies, features were extracted using manually designed or pre-trained feature extractors and representation learning was not performed simultaneously. Moreover, the generative probabilities of the features extracted from the sensory data could be predicted, but the sensory data could not be predicted in the cross-modal inference. Therefore, a method that can perform clustering, feature learning, and cross-modal inference among multimodal sensory data is required for concept formation. To realize such a method, we extend the VAE to the multinomial VAE (MNVAE), the latent variables of which follow a multinomial distribution, and construct a model that integrates the MNVAE and MLDA. In the experiments, the multimodal information of the images and words acquired by a robot was classified using the integrated model. The results demonstrated that the integrated model can classify the multimodal information as accurately as the previous model despite the feature extractor learning in an unsupervised manner, suitable image features for clustering can be learned, and cross-modal inference from the words to images is possible.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] On Memorization in Probabilistic Deep Generative Models
    van den Burg, Gerrit J. J.
    Williams, Christopher K. I.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Probabilistic forecasting using deep generative models
    Alessandro Fanfarillo
    Behrooz Roozitalab
    Weiming Hu
    Guido Cervone
    [J]. GeoInformatica, 2021, 25 : 127 - 147
  • [3] Probabilistic forecasting using deep generative models
    Fanfarillo, Alessandro
    Roozitalab, Behrooz
    Hu, Weiming
    Cervone, Guido
    [J]. GEOINFORMATICA, 2021, 25 (01) : 127 - 147
  • [4] Probabilistic Typology: Deep Generative Models of Vowel Inventories
    Cotterell, Ryan
    Eisner, Jason
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1182 - 1192
  • [5] Human-Robot Interaction in Concept Acquisition: a computational model
    de Greeff, Joachim
    Delaunay, Frederic
    Belpaeme, Tony
    [J]. 2009 IEEE 8TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2009, : 168 - 173
  • [6] Problems using deep generative models for probabilistic audio source separation
    Frank, Maurice
    Ilse, Maximilian
    [J]. NEURIPS WORKSHOPS, 2020, 2020, 137 : 53 - 59
  • [7] Toward Discriminating and Synthesizing Motion Traces Using Deep Probabilistic Generative Models
    Zhou, Fan
    Liu, Xin
    Zhang, Kunpeng
    Trajcevski, Goce
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2401 - 2414
  • [8] Coupling Deep Discriminative and Generative Models for Reactive Robot Planning in Human-Robot Collaboration
    Oshin, Olusegun
    Bernal, Edgar A.
    Nair, Binu M.
    Ding, Jerry
    Varma, Richa
    Osborne, Richard W.
    Tunstel, Eddie
    Stramandinoli, Francesca
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1869 - 1874
  • [9] Musical Similarity and Commonness Estimation based on Probabilistic Generative Models
    Nakano, Tomoyasu
    Yoshii, Kazuyoshi
    Goto, Masataka
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 197 - 204
  • [10] Probabilistic models for robot-based object segmentation
    Beale, Daniel
    Iravani, Pejman
    Hall, Peter
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2011, 59 (12) : 1080 - 1089