Fine-Grained Video Categorization with Redundancy Reduction Attention

被引:23
|
作者
Zhu, Chen [1 ]
Tan, Xiao [2 ]
Zhou, Feng [3 ]
Liu, Xiao [2 ]
Yue, Kaiyu [2 ]
Ding, Errui [2 ]
Ma, Yi [4 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Baidu Inc, Dept Comp Vis Technol VIS, Beijing, Peoples R China
[3] Baidu Res, Sunnyvale, CA USA
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
关键词
Fine-grained video categorization; Attention mechanism;
D O I
10.1007/978-3-030-01228-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For fine-grained categorization tasks, videos could serve as a better source than static images as videos have a higher chance of containing discriminative patterns. Nevertheless, a video sequence could also contain a lot of redundant and irrelevant frames. How to locate critical information of interest is a challenging task. In this paper, we propose a new network structure, known as Redundancy Reduction Attention (RRA), which learns to focus on multiple discriminative patterns by suppressing redundant feature channels. Specifically, it firstly summarizes the video by weight-summing all feature vectors in the feature maps of selected frames with a spatio-temporal soft attention, and then predicts which channels to suppress or to enhance according to this summary with a learned non-linear transform. Suppression is achieved by modulating the feature maps and threshing out weak activations. The updated feature maps are then used in the next iteration. Finally, the video is classified based on multiple summaries. The proposed method achieves outstanding performances in multiple video classification datasets. Furthermore, we have collected two large-scale video datasets, YouTube-Birds and YouTube-Cars, for future researches on fine-grained video categorization. The datasets are available at http://www.cs.umd.edu/similar to chenzhu/fgvc.
引用
收藏
页码:139 / 155
页数:17
相关论文
共 50 条
  • [1] R2-trans: Fine-grained visual categorization with redundancy reduction
    Ye, Shuo
    Yu, Shujian
    Wang, Yu
    You, Xinge
    [J]. IMAGE AND VISION COMPUTING, 2024, 143
  • [2] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    [J]. PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [3] Fine-grained redundancy in adders
    Ndai, Patrick
    Lu, Shih-Lien
    Somesekhar, Dinesh
    Roy, Kaushik
    [J]. ISQED 2007: PROCEEDINGS OF THE EIGHTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, 2007, : 317 - +
  • [4] Fine-Grained Categorization by Alignments
    Gavves, E.
    Fernando, B.
    Snoek, C. G. M.
    Smeulders, A. W. M.
    Tuytelaars, T.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1713 - 1720
  • [5] Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization
    Liu, Chuanbin
    Xie, Hongtao
    Zha, Zheng-Jun
    Ma, Lingfeng
    Yu, Lingyun
    Zhang, Yongdong
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11555 - 11562
  • [6] Multistage attention region supplement transformer for fine-grained visual categorization
    Mei, Aokun
    Huo, Hua
    Xu, Jiaxin
    Xu, Ningya
    [J]. VISUAL COMPUTER, 2024,
  • [7] Multiscale attention dynamic aware network for fine-grained visual categorization
    Ou, Jichu
    Li, Wanyi
    Huang, Jingmin
    Huang, Xiaojie
    Xie, Xuan
    [J]. ELECTRONICS LETTERS, 2023, 59 (01)
  • [8] Local Alignments for Fine-Grained Categorization
    Efstratios Gavves
    Basura Fernando
    Cees G. M. Snoek
    Arnold W. M. Smeulders
    Tinne Tuytelaars
    [J]. International Journal of Computer Vision, 2015, 111 : 191 - 212
  • [9] Local Alignments for Fine-Grained Categorization
    Gavves, Efstratios
    Fernando, Basura
    Snoek, Cees G. M.
    Smeulders, Arnold W. M.
    Tuytelaars, Tinne
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (02) : 191 - 212
  • [10] A Survey of Fine-Grained Image Categorization
    Zheng, Min
    Li, Qingyong
    Geng, Yangli-ao
    Yu, Haomin
    Wang, Jianzhu
    Gan, Jinrui
    Xue, Wenyuan
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 533 - 538