Multimodal Integration Learning of Object Manipulation Behaviors using Deep Neural Networks

被引:0
|
作者
Noda, Kuniaki [1 ]
Arie, Hiroaki [1 ]
Suga, Yuki [1 ]
Ogata, Testuya [1 ]
机构
[1] Waseda Univ, Grad Sch Fundamental Sci & Engn, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel computational approach for modeling and generating multiple object manipulation behaviors by a humanoid robot. The contribution of this paper is that deep learning methods are applied not only for multimodal sensor fusion but also for sensory-motor coordination. More specifically, a time-delay deep neural network is applied for modeling multiple behavior patterns represented with multi-dimensional visuomotor temporal sequences. By using the efficient training performance of Hessian-free optimization, the proposed mechanism successfully models six different object manipulation behaviors in a single network. The generalization capability of the learning mechanism enables the acquired model to perform the functions of cross-modal memory retrieval and temporal sequence prediction. The experimental results show that the motion patterns for object manipulation behaviors are successfully generated from the corresponding image sequence, and vice versa. Moreover, the temporal sequence prediction enables the robot to interactively switch multiple behaviors in accordance with changes in the displayed objects.
引用
收藏
页码:1728 / 1733
页数:6
相关论文
共 50 条
  • [1] Multimodal integration learning of robot behavior using deep neural networks
    Noda, Kuniaki
    Arie, Hiroaki
    Suga, Yuki
    Ogata, Tetsuya
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (06) : 721 - 736
  • [2] 3-D Deformable Object Manipulation Using Deep Neural Networks
    Hu, Zhe
    Han, Tao
    Sun, Peigen
    Pan, Jia
    Manocha, Dinesh
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4255 - 4261
  • [3] Multimodal Deep Neural Networks Based Ensemble Learning for X-Ray Object Recognition
    Kong, Quan
    Akira, Naoto
    Tong, Bin
    Watanabe, Yuki
    Matsubara, Daisuke
    Murakami, Tomokazu
    [J]. COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 523 - 538
  • [4] Object Detection Using Deep Neural Networks
    Shah, Malay
    Kapdi, Rupal
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 787 - 790
  • [5] Multimodal data integration for oncology in the era of deep neural networks: a review
    Waqas, Asim
    Tripathi, Aakash
    Ramachandran, Ravi P.
    Stewart, Paul A.
    Rasool, Ghulam
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [6] Multimodal Emotion Recognition Using Deep Neural Networks
    Tang, Hao
    Liu, Wei
    Zheng, Wei-Long
    Lu, Bao-Liang
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 811 - 819
  • [7] Multimodal Sentiment Analysis Using Deep Neural Networks
    Abburi, Harika
    Prasath, Rajendra
    Shrivastava, Manish
    Gangashetty, Suryakanth V.
    [J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION (MIKE 2016), 2017, 10089 : 58 - 65
  • [8] RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS
    Rahman, Mohammad Muntasir
    Tan, Yanhao
    Xue, Jian
    Lu, Ke
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 991 - 996
  • [9] Transfer Learning for Object Detection using State-of-the-Art Deep Neural Networks
    Talukdar, J.
    Gupta, S.
    Rajpura, P. S.
    Hegde, R. S.
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 78 - 83
  • [10] The Effectiveness of Using a Pretrained Deep Learning Neural Networks for Object Classification in Underwater Video
    Szymak, Piotr
    Piskur, Pawel
    Naus, Krzysztof
    [J]. REMOTE SENSING, 2020, 12 (18)