Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

被引：14

作者：

Li, Tianjiao ^{[1
]}

Foo, Lin Geng ^{[1
]}

Ke, Qiuhong ^{[2
]}

Rahmani, Hossein ^{[3
]}

Wang, Anran ^{[4
]}

Wang, Jinghua ^{[5
]}

Liu, Jun ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, ISTD Pillar, Singapore, Singapore

[2] Monash Univ, Dept Data Sci & AI, Melbourne, Vic, Australia

[3] Univ Lancaster, Sch Comp & Commun, Lancaster, England

[4] ByteDance, Beijing, Peoples R China

[5] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China

来源：

COMPUTER VISION - ECCV 2022, PT IV | 2022年 / 13664卷

基金：

新加坡国家研究基金会;

关键词：

Action recognition; Fine-grained; Dynamic neural networks; HUMAN NEURAL SYSTEM; FACE; REPRESENTATIONS; IDENTITY;

D O I：

10.1007/978-3-031-19772-7_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The goal of fine-grained action recognition is to successfully discriminate between action categories with subtle differences. To tackle this, we derive inspiration from the human visual system which contains specialized regions in the brain that are dedicated towards handling specific tasks. We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neurons that are only activated for a subset of samples that are highly similar. During training, the loss forces the specialized neurons to learn discriminative fine-grained differences to distinguish between these similar samples, improving fine-grained recognition. Moreover, a spatio-temporal specialization method further optimizes the architectures of the specialized neurons to capture either more spatial or temporal fine-grained information, to better tackle the large range of spatio-temporal variations in the videos. Lastly, we design an Upstream-Downstream Learning algorithm to optimize our model's dynamic decisions during training, improving the performance of our DSTS module. We obtain state-of-the-art performance on two widely-used fine-grained action recognition datasets.

引用

页码：386 / 403

页数：18

共 50 条

[31] Fine-Grained Crowdsourcing for Fine-Grained Recognition
Jia Deng
Krause, Jonathan
Li Fei-Fei
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
[32] JOINT LEARNING ON THE HIERARCHY REPRESENTATION FOR FINE-GRAINED HUMAN ACTION RECOGNITION
Leong, Mei Chee
Tan, Hui Li
Zhang, Haosong
Li, Liyuan
Lin, Feng
Lim, Joo Hwee
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1059 - 1063
[33] Fine-grained skeleton action recognition with pairwise motion salience learning
Li H.
Tu Z.
Xie W.
Zhang J.
Scientia Sinica Informationis, 2023, 53 (12) : 2440 - 2457
[34] TaiChi: A Fine-Grained Action Recognition Dataset
Sun, Shan
Wang, Feng
Liang, Qi
He, Liang
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 434 - 438
[35] CTM: Cross-time temporal module for fine-grained action recognition
Qian, Huifang
Zhang, Jialun
Yi, Jianping
Shi, Zhenyu
Zhang, Yimin
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 244
[36] Learning Dynamic Spatio-Temporal Relations for Human Activity Recognition
Liu, Zhenyu
Yao, Yaqiang
Liu, Yan
Zhu, Yuening
Tao, Zhenchao
Wang, Lei
Feng, Yuhong
IEEE ACCESS, 2020, 8 : 130340 - 130352
[37] A Spatio-temporal Feature Learning Approach for Dynamic Scene Recognition
Ullah, Ihsan
Petrosino, Alfredo
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 591 - 598
[38] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
Gao, Junyu
Chen, Mengyuan
Xu, Changsheng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19967 - 19977
[39] A Spatio-temporal Graph Transformer driven model for recognizing fine-grained data human activity
Mao, Yan
Zhang, Guoyin
Ye, Cuicui
ALEXANDRIA ENGINEERING JOURNAL, 2024, 104 : 31 - 45
[40] Fine-grained predicting urban crowd flows with adaptive spatio-temporal graph convolutional network
Yang, Xu
Zhu, Qiang
Li, Peihao
Chen, Pengpeng
Niu, Qiang
NEUROCOMPUTING, 2021, 446 : 95 - 105

← 1 2 3 4 5 →