Informative shape representations for human action recognition

被引:0
|
作者
Wang, Liang [1 ]
Suter, David [1 ]
机构
[1] Monash Univ, ARC Ctr Percept & Intelligent Machines Complex En, Clayton, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shape and kinematics are two important cues in human movement analysis. Due to real difficulties in extracting kinematics from videos accurately, this paper proposes to address the problem of human action recognition by spatiotemporal shape analysis. Without explicit feature tracking and complex probabilistic modeling of human movements, we directly convert an associated sequence of human silhouettes derived from videos into two types of computationally efficient representations, i.e., average motion energy and mean motion shape, to characterize actions. Supervised pattern classification techniques using various distance measures are used for recognition. The encouraging experimental results are obtained on a recent dataset including 10 different actions from 9 subjects.
引用
收藏
页码:1266 / +
页数:2
相关论文
共 50 条
  • [31] The structure of object-shape representations in visual recognition
    Leek, EC
    Reppa, I
    Arguin, M
    PERCEPTION, 2003, 32 : 120 - 120
  • [32] Disentangling representations of shape and action components in the tool network
    Wang, Xiaoying
    Zhuang, Tonghe
    Shen, Jiasi
    Bi, Yanchao
    NEUROPSYCHOLOGIA, 2018, 117 : 199 - 210
  • [33] LEARNING DISCRIMINATIVE ACTION AND CONTEXT REPRESENTATIONS FOR ACTION RECOGNITION IN STILL IMAGES
    Xin, Miao
    Zhang, Hong
    Yuan, Ding
    Sun, Mingui
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 757 - 762
  • [34] Informative representations of unstructured environments
    Kumar, S
    Guivant, J
    Durrant-Whyte, H
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 212 - 217
  • [35] Layered representations for human activity recognition
    Oliver, N
    Horvitz, E
    Garg, A
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 3 - 8
  • [36] Learning sparse representations for view-independent human action recognition based on fuzzy distances
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    NEUROCOMPUTING, 2013, 121 : 344 - 353
  • [37] Learning Discriminative Representations for Skeleton Based Action Recognition
    Zhou, Huanyu
    Liu, Qingjie
    Wang, Yunhong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10608 - 10617
  • [38] Invariant recognition drives neural representations of action sequences
    Tacchetti A.
    Isik L.
    Poggio T.
    Tacchetti, Andrea (atacchet@mit.edu), 1600, Public Library of Science (13):
  • [39] Complete Video-Level Representations for Action Recognition
    Li, Min
    Bai, Ruwen
    Meng, Bo
    Ren, Junxing
    Jiang, Miao
    Yang, Yang
    Li, Linghan
    Du, Hong
    IEEE ACCESS, 2021, 9 : 92134 - 92142
  • [40] Deep Set Conditioned Latent Representations for Action Recognition
    Singh, Akash
    de Schepper, Tom
    Mets, Kevin
    Hellinckx, Peter
    Oramas, Jose
    Latre, Steven
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 456 - 466