Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions

被引:3
|
作者
Matsuo, Tadashi [1 ]
Shimada, Nobutaka [1 ]
机构
[1] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu 5258577, Japan
关键词
feature extraction; unsupervised machine learning; object classification; SPARSE REPRESENTATION; L(1);
D O I
10.1587/transinf.2016EDP7410
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Appearance-based generic object recognition is a challenging problem because all possible appearances of objects cannot be registered, especially as new objects are produced every day. Function of objects, however, has a comparatively small number of prototypes. Therefore, function-based classification of new objects could be a valuable tool for generic object recognition. Object functions are closely related to hand-object interactions during handling of a functional object; i.e., how the hand approaches the object, which parts of the object and contact the hand, and the shape of the hand during interaction. Hand-object interactions are helpful for modeling object functions. However, it is difficult to assign discrete labels to interactions because an object shape and grasping hand-postures intrinsically have continuous variations. To describe these interactions, we propose the interaction descriptor space which is acquired from unlabeled appearances of human hand-object interactions. By using interaction descriptors, we can numerically describe the relation between an object's appearance and its possible interaction with the hand. The model infers the quantitative state of the interaction from the object image alone. It also identifies the parts of objects designed for hand interactions such as grips and handles. We demonstrate that the proposed method can unsupervisedly generate interaction descriptors that make clusters corresponding to interaction types. And also we demonstrate that the model can infer possible hand-object interactions.
引用
收藏
页码:1350 / 1359
页数:10
相关论文
共 50 条
  • [31] The effects of explicit and implicit information on modulation of corticospinal excitability during hand-object interactions
    Rens, Guy
    Davare, Marco
    van Polanen, Vonne
    NEUROPSYCHOLOGIA, 2022, 177
  • [32] D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
    Christen, Sammy
    Kocabas, Muhammed
    Aksan, Emre
    Hwangbo, Jemin
    Song, Jie
    Hilliges, Otmar
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20545 - 20554
  • [33] Towards Force Sensing from Vision: Observing Hand-Object Interactions to Infer Manipulation Forces
    Tu-Hoa Pham
    Kheddar, Abderrahmane
    Qammaz, Ammar
    Argyros, Antonis A.
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2810 - 2819
  • [34] Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications
    Zhang, Lingzhi
    Zhou, Shenghao
    Stent, Simon
    Shi, Jianbo
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 127 - 145
  • [35] Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time
    Liu, Shaowei
    Jiang, Hanwen
    Xu, Jiarui
    Liu, Sifei
    Wang, Xiaolong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14682 - 14692
  • [36] Resolving hand-object occlusion for mixed reality with joint deep learning and model optimization
    Feng, Qi
    Shum, Hubert P. H.
    Morishima, Shigeo
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [37] HandO: a hybrid 3D hand-object reconstruction model for unknown objects
    Yu, Hang
    Cheang, Chilam
    Fu, Yanwei
    Xue, Xiangyang
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1845 - 1859
  • [38] HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation
    Doosti, Bardia
    Naha, Shujon
    Mirbagheri, Majid
    Crandall, David J.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6607 - 6616
  • [39] Left inferior parietal representations for skilled hand-object interactions: Evidence from stroke and corticobasal degeneration
    Buxbaum, Laurel J.
    Kyle, Kathleen
    Grossman, Murray
    Coslett, H. Branch
    CORTEX, 2007, 43 (03) : 411 - 423
  • [40] H plus O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions
    Tekin, Bugra
    Bogo, Federica
    Pollefeys, Marc
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4506 - 4515