Learning shared embedding representation of motion and text using contrastive learning

被引:0
|
作者
Junpei Horie
Wataru Noguchi
Hiroyuki Iizuka
Masahito Yamamoto
机构
[1] Hokkaido University,Graduate School of Information Science and Technology
[2] Hokkaido University,Faculty of Information Science and Technology
[3] Hokkaido University,Center for Human Nature, Artificial Intelligence, and Neuroscience
来源
关键词
Multi-modal learning; Contrastive learning; Skeleton-based action recognition; Motion retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Multimodal learning of motion and text tries to find the correspondence between skeletal time-series data acquired by motion capture and the text that describes the motion. In this field, good associations can realize both motion-to-text and text-to-motion applications. However, the previous methods failed to associate motion with text, taking into account details of descriptions, for example, whether to move the left or right arm. In this paper, we propose a motion-text contrastive learning method for making correspondences between motion and text in a shared embedding space. We showed that our model outperforms the previous studies in the task of action recognition. We also qualitatively show that, by using a pre-trained text encoder, our model can perform motion retrieval with detailed correspondences between motion and text.
引用
收藏
页码:148 / 157
页数:9
相关论文
共 50 条
  • [31] CONHyperKGE: Using Contrastive Learning in Hyperbolic Space for Knowledge Graph Embedding
    Gao, Mandeng
    Tian, Shengwei
    Yu, Long
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (04)
  • [32] Contrastive Coincidental Correctness Representation Learning
    Li, Maojin
    Lei, Yan
    Xie, Huan
    Wang, Jiaguo
    Liu, Chunyan
    Deng, Zhengxiong
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, ISSRE, 2023, : 252 - 263
  • [33] Learning Contrastive Representation for Semantic Correspondence
    Xiao, Taihong
    Liu, Sifei
    De Mello, Shalini
    Yu, Zhiding
    Kautz, Jan
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (05) : 1293 - 1309
  • [34] Kalman contrastive unsupervised representation learning
    Yekta, Mohammad Mahdi Jahani
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [35] Partitioning Image Representation in Contrastive Learning
    Lee, Hyunsub
    Choi, Heeyoul
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2864 - 2870
  • [36] Representation-enhanced APT Detection Using Contrastive Learning
    Zhou, Fengxi
    Chang, Baoming
    Wen, Yu
    Meng, Dan
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 1 - 9
  • [37] Spatiotemporal Contrastive Video Representation Learning
    Qian, Rui
    Meng, Tianjian
    Gong, Boqing
    Yang, Ming-Hsuan
    Wang, Huisheng
    Belongie, Serge
    Cui, Yin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6960 - 6970
  • [38] Geometric Multimodal Contrastive Representation Learning
    Poklukar, Petra
    Vasco, Miguel
    Yin, Hang
    Melo, Francisco S.
    Paiva, Ana
    Kragic, Danica
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [39] Contrastive Representation Learning for Gaze Estimation
    Jindal, Swati
    Manduchi, Roberto
    GAZE MEETS MACHINE LEARNING WORKSHOP, VOL 210, 2022, 210 : 37 - +
  • [40] Contrastive Representation Learning for Electroencephalogram Classification
    Mohsenvand, Mostafa 'Neo'
    Izadi, Mohammad Rasool
    Maes, Pattie
    MACHINE LEARNING FOR HEALTH, VOL 136, 2020, 136 : 238 - 253