Pipelining Localized Semantic Features for Fine-Grained Action Recognition

被引:0
|
作者
Zhou, Yang [1 ]
Ni, Bingbing [2 ]
Yan, Shuicheng [3 ]
Moulin, Pierre [4 ]
Tian, Qi [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Natl Univ Singapore, Singapore, Singapore
[4] Univ Illinois, Urbana, IL USA
来源
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In fine-grained action (object manipulation) recognition, it is important to encode object semantic (contextual) information, i.e., which object is being manipulated and how it is being operated. However, previous methods for action recognition often represent the semantic information in a global and coarse way and therefore cannot cope with fine-grained actions. In this work, we propose a representation and classification pipeline which seamlessly incorporates localized semantic information into every processing step for fine-grained action recognition. In the feature extraction stage, we explore the geometric information between local motion features and the surrounding objects. In the feature encoding stage, we develop a semantic-grouped locality-constrained linear coding (SG-LLC) method that captures the joint distributions between motion and object-in-use information. Finally, we propose a semantic-aware multiple kernel learning framework (SA-MKL) by utilizing the empirical joint distribution between action and object type for more discriminative action classification. Extensive experiments are performed on the large-scale and difficult fine-grained MPII cooking action dataset. The results show that by effectively accumulating localized semantic information into the action representation and classification pipeline, we significantly improve the fine-grained action classification performance over the existing methods.
引用
收藏
页码:481 / 496
页数:16
相关论文
共 50 条
  • [41] Refining deep convolutional features for improving fine-grained image recognition
    Zhang, Weixia
    Yan, Jia
    Shi, Wenxuan
    Feng, Tianpeng
    Deng, Dexiang
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2017,
  • [42] Temporal and Fine-Grained Pedestrian Action Recognition on Driving Recorder Database
    Kataoka, Hirokatsu
    Satoh, Yutaka
    Aoki, Yoshimitsu
    Oikawa, Shoko
    Matsui, Yasuhiro
    [J]. SENSORS, 2018, 18 (02)
  • [43] Fine-grained action recognition using multi-view attentions
    Zhu, Yisheng
    Liu, Guangcan
    [J]. VISUAL COMPUTER, 2020, 36 (09): : 1771 - 1781
  • [44] Fine-grained Action Recognition with Robust Motion Representation Decoupling and Concentration
    Sun, Baoli
    Ye, Xinchen
    Yan, Tiantian
    Wang, Zhihui
    Li, Haojie
    Wang, Zhiyong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4779 - 4788
  • [45] Multi-Modal Domain Adaptation for Fine-grained Action Recognition
    Munro, Jonathan
    Damen, Dima
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3723 - 3726
  • [46] Fine-grained action recognition of boxing punches from depth imagery
    Kasiri, Soudeh
    Fookes, Clinton
    Sridharan, Sridha
    Morgan, Stuart
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 159 : 143 - 153
  • [47] Category-specific Semantic Coherency Learning for Fine-grained Image Recognition
    Wang, Shijie
    Wang, Zhihui
    Li, Haojie
    Ouyang, Wanli
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 174 - 183
  • [48] Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition
    Wang, Shijie
    Wang, Zhihui
    Li, Haojie
    Chang, Jianlong
    Ouyang, Wanli
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6558 - 6570
  • [49] Dynamic semantic structure distillation for low-resolution fine-grained recognition
    Liang, Mingjiang
    Huang, Shaoli
    Liu, Wei
    [J]. PATTERN RECOGNITION, 2024, 148
  • [50] Exploring Coarse-to-Fine Action Token Localization and Interaction for Fine-grained Video Action Recognition
    Sun, Baoli
    Ye, Xinchen
    Wang, Zhihui
    Li, Haojie
    Wang, Zhiyong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5070 - 5078