ASNet: Auto-Augmented Siamese Neural Network for Action Recognition

被引:3
|
作者
Zhang, Yujia [1 ]
Po, Lai-Man [1 ]
Xiong, Jingjing [1 ]
Rehman, Yasar Abbas Ur [2 ]
Cheung, Kwok-Wai [3 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] TCL Corp Res Co Ltd, Hong Kong, Peoples R China
[3] Hang Seng Univ Hong Kong, Sch Commun, Hong Kong, Peoples R China
关键词
action recognition; 3D-CNN; deep reinforcement learning; data augmentation; ATTENTION NETWORK; MASK;
D O I
10.3390/s21144720
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human action recognition methods in videos based on deep convolutional neural networks usually use random cropping or its variants for data augmentation. However, this traditional data augmentation approach may generate many non-informative samples (video patches covering only a small part of the foreground or only the background) that are not related to a specific action. These samples can be regarded as noisy samples with incorrect labels, which reduces the overall action recognition performance. In this paper, we attempt to mitigate the impact of noisy samples by proposing an Auto-augmented Siamese Neural Network (ASNet). In this framework, we propose backpropagating salient patches and randomly cropped samples in the same iteration to perform gradient compensation to alleviate the adverse gradient effects of non-informative samples. Salient patches refer to the samples containing critical information for human action recognition. The generation of salient patches is formulated as a Markov decision process, and a reinforcement learning agent called SPA (Salient Patch Agent) is introduced to extract patches in a weakly supervised manner without extra labels. Extensive experiments were conducted on two well-known datasets UCF-101 and HMDB-51 to verify the effectiveness of the proposed SPA and ASNet.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Siamese Convolutional Neural Network for ASL Alphabet Recognition
    Fierro Radilla, Atoany Nazareth
    Perez Daniel, Karina Ruby
    COMPUTACION Y SISTEMAS, 2020, 24 (03): : 1211 - 1218
  • [2] A Siamese neural network framework for glass transition recognition
    Osiecka-Drewniak, Natalia
    Deptuch, Aleksandra
    Urbanska, Magdalena
    Juszynska-Galazka, Ewa
    SOFT MATTER, 2024, 20 (10) : 2400 - 2406
  • [3] Polar Sine Based Siamese Neural Network for Gesture Recognition
    Berlemont, Samuel
    Lefebvre, Gregoire
    Duffner, Stefan
    Garcia, Christophe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 406 - 414
  • [4] Face recognition algorithm incorporating CBAM and Siamese neural network
    Meng X.
    Li Y.
    Wang G.
    Meng T.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (21): : 3192 - 3202
  • [5] A novel fingerprint recognition method based on a Siamese neural network
    Li, Zihao
    Wang, Yizhi
    Yang, Zhong
    Tian, Xiaomin
    Zhai, Lixin
    Wu, Xiao
    Yu, Jianpeng
    Gu, Shanshan
    Huang, Lingyi
    Zhang, Yang
    JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 690 - 705
  • [6] SIAMESE NEURAL NETWORK BASED GAIT RECOGNITION FOR HUMAN IDENTIFICATION
    Zhang, Cheng
    Liu, Wu
    Ma, Huadong
    Fu, Huiyuan
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2832 - 2836
  • [7] Self-Attention based Siamese Neural Network recognition Model
    Liu, Yuxing
    Chang, Geng
    Fu, Guofeng
    Wei, Yingchao
    Lan, Jie
    Liu, Jiarui
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 721 - 724
  • [8] Traffic Sign Recognition Algorithm Based on Siamese Neural Network with Encoder
    Lyu, Binglue
    Xi, Zhenghao
    Shao, Yuchao
    Computer Engineering and Applications, 2023, 59 (11) : 105 - 111
  • [9] Isolated Sign Recognition with a Siamese Neural Network of RGB and Depth Streams
    Tur, Anil Osman
    Keles, Hacer Yalim
    PROCEEDINGS OF 18TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES (IEEE EUROCON 2019), 2019,
  • [10] Auto-associative neural network system for recognition
    Zeng, Xian-Hua
    Luo, Si-Wei
    Wang, Jiao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2885 - 2890