Pipelining Localized Semantic Features for Fine-Grained Action Recognition

被引:0
|
作者
Zhou, Yang [1 ]
Ni, Bingbing [2 ]
Yan, Shuicheng [3 ]
Moulin, Pierre [4 ]
Tian, Qi [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Natl Univ Singapore, Singapore, Singapore
[4] Univ Illinois, Urbana, IL USA
来源
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In fine-grained action (object manipulation) recognition, it is important to encode object semantic (contextual) information, i.e., which object is being manipulated and how it is being operated. However, previous methods for action recognition often represent the semantic information in a global and coarse way and therefore cannot cope with fine-grained actions. In this work, we propose a representation and classification pipeline which seamlessly incorporates localized semantic information into every processing step for fine-grained action recognition. In the feature extraction stage, we explore the geometric information between local motion features and the surrounding objects. In the feature encoding stage, we develop a semantic-grouped locality-constrained linear coding (SG-LLC) method that captures the joint distributions between motion and object-in-use information. Finally, we propose a semantic-aware multiple kernel learning framework (SA-MKL) by utilizing the empirical joint distribution between action and object type for more discriminative action classification. Extensive experiments are performed on the large-scale and difficult fine-grained MPII cooking action dataset. The results show that by effectively accumulating localized semantic information into the action representation and classification pipeline, we significantly improve the fine-grained action classification performance over the existing methods.
引用
收藏
页码:481 / 496
页数:16
相关论文
共 50 条
  • [1] Discovering Localized Attributes for Fine-grained Recognition
    Duan, Kun
    Parikh, Devi
    Crandall, David
    Grauman, Kristen
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3474 - 3481
  • [2] Semantic bilinear pooling for fine-grained recognition
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
    [J]. Proc. Int. Conf. Pattern Recognit., (3660-3666):
  • [3] Semantic Bilinear Pooling for Fine-Grained Recognition
    Li, Xinjie
    Yang, Chun
    Chen, Song-Lu
    Zhu, Chao
    Yin, Xu-Cheng
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
  • [4] Fine-Grained Car Recognition Model Based on Semantic DCNN Features Fusion
    Yang, Juan
    Cao, Haoyu
    Wang, Ronggui
    Xue, Lixia
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 141 - 157
  • [5] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [6] Learning Features and Parts for Fine-Grained Recognition
    Krause, Jonathan
    Gebru, Timnit
    Deng, Jia
    Li, Li-Jia
    Li Fei-Fei
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 26 - 33
  • [7] TaiChi: A Fine-Grained Action Recognition Dataset
    Sun, Shan
    Wang, Feng
    Liang, Qi
    He, Liang
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 434 - 438
  • [8] Fine-Grained Recognition With Learnable Semantic Data Augmentation
    Pu, Yifan
    Han, Yizeng
    Wang, Yulin
    Feng, Junlan
    Deng, Chao
    Huang, Gao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3130 - 3144
  • [9] Semantic interaction learning for fine-grained vehicle recognition
    Zhang, Jingjing
    Lei, Jingsheng
    Yang, Shengying
    Yang, Xinqi
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (01)
  • [10] Semantic Clustering for Robust Fine-Grained Scene Recognition
    George, Marian
    Dixit, Mandar
    Zogg, Gabor
    Vasconcelos, Nuno
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 783 - 798