Hierarchical Attention Network for Action Segmentation

被引:4
|
作者
Gammulle, Harshala [1 ]
Denman, Simon [1 ]
Sridharan, Sridha [1 ]
Fookes, Clinton [1 ]
机构
[1] Queensland Univ Technol, SAIVT, Image & Video Res Lab, Brisbane, Qld, Australia
关键词
Cameras;
D O I
10.1016/j.patrec.2020.01.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal segmentation of events is an essential task and a precursor for the automatic recognition of human actions in the video. Several attempts have been made to capture frame-level salient aspects through attention but they lack the capacity to effectively map the temporal relationships in between the frames as they only capture a limited span of temporal dependencies. To this end we propose a complete end-to-end supervised learning approach that can better learn relationships between actions over time, thus improving the overall segmentation performance. The proposed hierarchical recurrent attention framework analyses the input video at multiple temporal scales, to form embeddings at frame level and segment level, and perform fine-grained action segmentation. This generates a simple, lightweight, yet extremely effective architecture for segmenting continuous video streams and has multiple application domains. We evaluate our system on multiple challenging public benchmark datasets, including MERL Shopping, 50 salads, and Georgia Tech Egocentric datasets and achieves state-of-the-art performance. The evaluated datasets encompass numerous video capture settings which are inclusive of static overhead camera views and dynamic, ego-centric head-mounted camera views, demonstrating the direct applicability of the proposed framework in a variety of settings. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:442 / 448
页数:7
相关论文
共 50 条
  • [1] HANA: Hierarchical Attention Network Assembling for Semantic Segmentation
    Wei Liu
    Ding Li
    Hongqi Su
    Cognitive Computation, 2021, 13 : 1128 - 1135
  • [2] HANA: Hierarchical Attention Network Assembling for Semantic Segmentation
    Liu, Wei
    Li, Ding
    Su, Hongqi
    COGNITIVE COMPUTATION, 2021, 13 (05) : 1128 - 1135
  • [3] Motion saliency based hierarchical attention network for action recognition
    Guo, Zihui
    Hou, Yonghong
    Xiao, Renyi
    Li, Chuankun
    Li, Wanqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 4533 - 4550
  • [4] Motion saliency based hierarchical attention network for action recognition
    Zihui Guo
    Yonghong Hou
    Renyi Xiao
    Chuankun Li
    Wanqing Li
    Multimedia Tools and Applications, 2023, 82 : 4533 - 4550
  • [5] Hierarchical Self-Attention Network for Action Localization in Videos
    Pramono, Rizard Renanda Adhi
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 61 - 70
  • [6] A multi-branch hierarchical attention network for medical target segmentation
    Yu, Yongtao
    Tao, Yifei
    Guan, Haiyan
    Xiao, Shaozhang
    Li, Fenfen
    Yu, Changhui
    Liu, Zuojun
    Li, Jonathan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [7] HANet: Hierarchical Attention Network for Remote Sensing Images Semantic Segmentation
    Zhang, Hongming
    Yang, Guang
    Gao, Zhengjie
    Shen, Yinwei
    Tang, Hengao
    Wang, Tao
    Han, Yamin
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 386 - 400
  • [8] LHAS: A Lightweight Network Based on Hierarchical Attention for Hyperspectral Image Segmentation
    Song, Lujie
    Gao, Yunhao
    Gui, Yuanyuan
    Jiang, Daguang
    Zhang, Mengmeng
    Liu, Huan
    Li, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [9] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
    Zhao, Zongji
    Zhao, Sanyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335
  • [10] Discriminative Feature Network Based on a Hierarchical Attention Mechanism for Semantic Hippocampus Segmentation
    Shi, Jiali
    Zhang, Rong
    Guo, Lijun
    Gao, Linlin
    Ma, Huifang
    Wang, Jianhua
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (02) : 504 - 513