Learning shared embedding representation of motion and text using contrastive learning

被引:0
|
作者
Junpei Horie
Wataru Noguchi
Hiroyuki Iizuka
Masahito Yamamoto
机构
[1] Hokkaido University,Graduate School of Information Science and Technology
[2] Hokkaido University,Faculty of Information Science and Technology
[3] Hokkaido University,Center for Human Nature, Artificial Intelligence, and Neuroscience
来源
关键词
Multi-modal learning; Contrastive learning; Skeleton-based action recognition; Motion retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Multimodal learning of motion and text tries to find the correspondence between skeletal time-series data acquired by motion capture and the text that describes the motion. In this field, good associations can realize both motion-to-text and text-to-motion applications. However, the previous methods failed to associate motion with text, taking into account details of descriptions, for example, whether to move the left or right arm. In this paper, we propose a motion-text contrastive learning method for making correspondences between motion and text in a shared embedding space. We showed that our model outperforms the previous studies in the task of action recognition. We also qualitatively show that, by using a pre-trained text encoder, our model can perform motion retrieval with detailed correspondences between motion and text.
引用
收藏
页码:148 / 157
页数:9
相关论文
共 50 条
  • [21] Table representation learning using heterogeneous graph embedding
    Tchuitcheu, Willy Carlos
    Lu, Tan
    Dooms, Ann
    PATTERN RECOGNITION, 2024, 156
  • [22] LoCo: Local Contrastive Representation Learning
    Xiong, Yuwen
    Ren, Mengye
    Urtasun, Raquel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] Learning Contrastive Representation for Semantic Correspondence
    Taihong Xiao
    Sifei Liu
    Shalini De Mello
    Zhiding Yu
    Jan Kautz
    Ming-Hsuan Yang
    International Journal of Computer Vision, 2022, 130 : 1293 - 1309
  • [24] Contrastive Representation Learning: A Framework and Review
    Le-Khac, Phuc H.
    Healy, Graham
    Smeaton, Alan F.
    IEEE ACCESS, 2020, 8 : 193907 - 193934
  • [25] CLSEP: Contrastive learning of sentence embedding with prompt
    Wang, Qian
    Zhang, Weiqi
    Lei, Tianyi
    Cao, Yu
    Peng, Dezhong
    Wang, Xu
    KNOWLEDGE-BASED SYSTEMS, 2023, 266
  • [26] Learning From Crowds With Contrastive Representation
    Yang, Hang
    Li, Xunbo
    Pedrycz, Witold
    IEEE ACCESS, 2023, 11 : 40182 - 40191
  • [27] Contrastive representation learning on dynamic networks
    Jiao, Pengfei
    Chen, Hongjiang
    Tang, Huijun
    Bao, Qing
    Zhang, Long
    Zhao, Zhidong
    Wu, Huaming
    NEURAL NETWORKS, 2024, 174
  • [28] Multilingual Representation Distillation with Contrastive Learning
    Tan, Weiting
    Heffernan, Kevin
    Schwenk, Holger
    Koehn, Philipp
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1477 - 1490
  • [29] Self-Supervised Facial Motion Representation Learning via Contrastive Subclips
    Sun, Zheng
    Torrie, Shad A.
    Sumsion, Andrew W.
    Lee, Dah-Jye
    ELECTRONICS, 2023, 12 (06)
  • [30] Graph Contrastive Learning on Complementary Embedding for Recommendation
    Liu, Meishan
    Jian, Meng
    Shi, Ge
    Xiang, Ye
    Wu, Lifang
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 576 - 580