ASNet: Auto-Augmented Siamese Neural Network for Action Recognition

被引:3
|
作者
Zhang, Yujia [1 ]
Po, Lai-Man [1 ]
Xiong, Jingjing [1 ]
Rehman, Yasar Abbas Ur [2 ]
Cheung, Kwok-Wai [3 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] TCL Corp Res Co Ltd, Hong Kong, Peoples R China
[3] Hang Seng Univ Hong Kong, Sch Commun, Hong Kong, Peoples R China
关键词
action recognition; 3D-CNN; deep reinforcement learning; data augmentation; ATTENTION NETWORK; MASK;
D O I
10.3390/s21144720
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human action recognition methods in videos based on deep convolutional neural networks usually use random cropping or its variants for data augmentation. However, this traditional data augmentation approach may generate many non-informative samples (video patches covering only a small part of the foreground or only the background) that are not related to a specific action. These samples can be regarded as noisy samples with incorrect labels, which reduces the overall action recognition performance. In this paper, we attempt to mitigate the impact of noisy samples by proposing an Auto-augmented Siamese Neural Network (ASNet). In this framework, we propose backpropagating salient patches and randomly cropped samples in the same iteration to perform gradient compensation to alleviate the adverse gradient effects of non-informative samples. Salient patches refer to the samples containing critical information for human action recognition. The generation of salient patches is formulated as a Markov decision process, and a reinforcement learning agent called SPA (Salient Patch Agent) is introduced to extract patches in a weakly supervised manner without extra labels. Extensive experiments were conducted on two well-known datasets UCF-101 and HMDB-51 to verify the effectiveness of the proposed SPA and ASNet.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Fingerprint recognition using convolution neural network with inversion and augmented techniques
    Garg, Reena
    Singh, Gunjan
    Singh, Aditya
    Singh, Manu Pratap
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [42] Attributed Network Embedding via a Siamese Neural Network
    Wang, Jiong
    Gao, Neng
    Peng, Jia
    Mo, Jingjie
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1101 - 1108
  • [43] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Xuemin Lu
    Wei Quan
    Reformat Marek
    Haiquan Zhao
    Jim X. Chen
    The Visual Computer, 2024, 40 : 3163 - 3181
  • [44] SiamMAST: Siamese motion-aware spatio-temporal network for video action recognition
    Lu, Xuemin
    Quan, Wei
    Marek, Reformat
    Zhao, Haiquan
    Chen, Jim X. X.
    VISUAL COMPUTER, 2024, 40 (05): : 3163 - 3181
  • [45] SAANet: Siamese action-units attention network for improving dynamic facial expression recognition
    Liu, Daizong
    Ouyang, Xi
    Xu, Shuangjie
    Zhou, Pan
    He, Kun
    Wen, Shiping
    NEUROCOMPUTING, 2020, 413 : 145 - 157
  • [46] Face recognition using CNN and siamese network
    Kumar C.R.
    N S.
    Priyadharshini M.
    E D.G.
    M K.R.
    Measurement: Sensors, 2023, 27
  • [47] Radar Working Mode Recognition Algorithm Based on Siamese Network and Deep Auto Encoder-Affinity Propagation
    Gao, Jingpeng
    Wang, Tingfei
    Ye, Fang
    2022 INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION (ISAP), 2022, : 371 - 372
  • [48] AN IMPROVED SIAMESE NETWORK FOR FACE SKETCH RECOGNITION
    Fan, Liang
    Liu, Han
    Hou, Yuxuan
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 344 - 350
  • [49] One-Shot Learning for Facial Sketch Recognition using the Siamese Convolutional Neural Network
    Sabri, Nuraina Iwani Ahmad
    Setumin, Samsul
    11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021), 2021, : 307 - 312
  • [50] Sparse Feature Auto-combination Deep Network for Video Action Recognition
    Wang, Qicong
    Gong, Dingxi
    Li, Maozhen
    Zhao, Chong
    Lei, Yunqi
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 712 - 716