Combining Multiple Sources of Knowledge in Deep CNNs for Action Recognition

被引:0
|
作者
Park, Eunbyung [1 ]
Han, Xufeng [1 ]
Berg, Tamara L. [1 ]
Berg, Alexander C. [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although deep convolutional neural networks (CNNs) have shown remarkable results for feature learning and prediction tasks, many recent studies have demonstrated improved performance by incorporating additional handcrafted features or by fusing predictions from multiple CNNs. Usually, these combinations are implemented via feature concatenation or by averaging output prediction scores from several CNNs. In this paper, we present new approaches for combining different sources of knowledge in deep learning. First, we propose feature amplification, where we use an auxiliary, hand-crafted, feature (e.g. optical flow) to perform spatially varying soft-gating on intermediate CNN feature maps. Second, we present a spatially varying multiplicative fusion method for combining multiple CNNs trained on different sources that results in robust prediction by amplifying or suppressing the feature activations based on their agreement. We test these methods in the context of action recognition where information from spatial and temporal cues is useful, obtaining results that are comparable with state-of-the-art methods and outperform methods using only CNNs and optical flow features.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Combining multiple deep cues for action recognition
    Ruiqi Wang
    Xinxiao Wu
    [J]. Multimedia Tools and Applications, 2019, 78 : 9933 - 9950
  • [2] Combining multiple deep cues for action recognition
    Wang, Ruiqi
    Wu, Xinxiao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (08) : 9933 - 9950
  • [3] Scaling up The Training of Deep CNNs for Human Action Recognition
    Rajeswar, M. Sai
    Sankar, A. Ravi
    Balasubramanian, Vineeth N.
    Sudheer, C. D.
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1172 - 1177
  • [4] COMBINING MULTIPLE KNOWLEDGE SOURCES IN AN INTEGRATED INTELLIGENT SYSTEM
    STEIER, DM
    LEWIS, RL
    LEHMAN, JF
    ZACHERL, AL
    [J]. IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1993, 8 (03): : 35 - 44
  • [5] Human activity recognition from multiple sensors data using deep CNNs
    Yasin Kaya
    Elif Kevser Topuz
    [J]. Multimedia Tools and Applications, 2024, 83 : 10815 - 10838
  • [6] Human activity recognition from multiple sensors data using deep CNNs
    Kaya, Yasin
    Topuz, Elif Kevser
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10815 - 10838
  • [7] Deep multiple aggregation networks for action recognition
    Ahmed Mazari
    Hichem Sahbi
    [J]. International Journal of Multimedia Information Retrieval, 2024, 13
  • [8] Deep multiple aggregation networks for action recognition
    Mazari, Ahmed
    Sahbi, Hichem
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (01)
  • [9] Three-stream CNNs for action recognition
    Wang, Liangliang
    Ge, Lianzheng
    Li, Ruifeng
    Fang, Yajun
    [J]. PATTERN RECOGNITION LETTERS, 2017, 92 : 33 - 40
  • [10] Action Recognition Using Deep 3D CNNs with Sequential Feature Aggregation and Attention
    Anvarov, Fazliddin
    Kim, Dae Ha
    Song, Byung Cheol
    [J]. ELECTRONICS, 2020, 9 (01)