Discriminative Segment Focus Network for Fine-grained Video Action Recognition

被引:0
|
作者
Sun, Baoli [1 ]
Ye, Xinchen [2 ]
Yan, Tiantian [3 ]
Wang, Zhihui [2 ]
Li, Haojie [4 ]
Wang, Zhiyong [5 ]
机构
[1] Dalian Univ Technol, Dalian, Liaoning, Peoples R China
[2] Dalian Univ Technol, DUT RU Int Sch Informat Sci & Engn, Dalian, Liaoning, Peoples R China
[3] Dalian Univ, Natl & Local Joint Engn Lab Comp Aided Design, Dalian, Liaoning, Peoples R China
[4] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Shandong, Peoples R China
[5] Univ Sydney, Sch Informat Technol, Sydney, NSW, Australia
关键词
Fine-grained action recognition; discriminative segment; correlation;
D O I
10.1145/3654671
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained video action recognition aims at identifying minor and discriminative variations among fine categories of actions. While many recent action recognition methods have been proposed to better model spatio-temporal representations, how to model the interactions among discriminative atomic actions to effectively characterize inter-class and intra-class variations has been neglected, which is vital for understanding fine-grained actions. In this work, we devise a Discriminative Segment Focus Network (DSFNet) to mine the discriminability of segment correlations and localize discriminative action-relevant segments for fine-grained video action recognition. Firstly, we propose a hierarchic correlation reasoning (HCR) module which explicitly establishes correlations between different segments at multiple temporal scales and enhances each segment by exploiting the correlations with other segments. Secondly, a discriminative segment focus (DSF) module is devised to localize the most action-relevant segments fromthe enhanced representations of HCR by enforcing the consistency between the discriminability and the classification confidence of a given segment with a consistency constraint. Finally, these localized segment representations are combined with the global action representation of the whole video for boosting final recognition. Extensive experimental results on two fine-grained action recognition datasets, i.e., FineGym and Diving48, and two action recognition datasets, i.e., Kinetics400 and Something-Something, demonstrate the effectiveness of our approach compared with the state-of-the-art methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Convolutional transformer network for fine-grained action recognition
    Ma, Yujun
    Wang, Ruili
    Zong, Ming
    Ji, Wanting
    Wang, Yi
    Ye, Baoliu
    [J]. NEUROCOMPUTING, 2024, 569
  • [2] Periodic-Aware Network for Fine-Grained Action Recognition
    Luo, Senzi
    Xiao, Jiayin
    Li, Dong
    Jian, Muwei
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 105 - 117
  • [3] Discriminative semantic region selection for fine-grained recognition
    Zhang, Chunjie
    Wang, Da-Han
    Li, Haisheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77
  • [4] Exploring Coarse-to-Fine Action Token Localization and Interaction for Fine-grained Video Action Recognition
    Sun, Baoli
    Ye, Xinchen
    Wang, Zhihui
    Li, Haojie
    Wang, Zhiyong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5070 - 5078
  • [5] Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network
    Zhou, Xuan
    Yi, Jianping
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 2103 - 2116
  • [6] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [7] TaiChi: A Fine-Grained Action Recognition Dataset
    Sun, Shan
    Wang, Feng
    Liang, Qi
    He, Liang
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 434 - 438
  • [8] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [9] A Fine-Grained Vehicle Behavior Recognition Framework: Struct Segment Temporal Convolutional Network
    Yan, Guozhi
    Liu, Kai
    Hu, Junbo
    Jin, Feiyu
    Zhang, Hao
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3404 - 3410
  • [10] Discriminative Feature Mining and Enhancement Network for Low-Resolution Fine-Grained Image Recognition
    Yan, Tiantian
    Li, Haojie
    Sun, Baoli
    Wang, Zhihui
    Luo, Zhongxuan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5319 - 5330