Hierarchical Attention Network for Action Segmentation

被引：4

作者：

Gammulle, Harshala ^{[1
]}

Denman, Simon ^{[1
]}

Sridharan, Sridha ^{[1
]}

Fookes, Clinton ^{[1
]}

机构：

[1] Queensland Univ Technol, SAIVT, Image & Video Res Lab, Brisbane, Qld, Australia

来源：

PATTERN RECOGNITION LETTERS | 2020年 / 131卷

关键词：

Cameras;

D O I：

10.1016/j.patrec.2020.01.023

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Temporal segmentation of events is an essential task and a precursor for the automatic recognition of human actions in the video. Several attempts have been made to capture frame-level salient aspects through attention but they lack the capacity to effectively map the temporal relationships in between the frames as they only capture a limited span of temporal dependencies. To this end we propose a complete end-to-end supervised learning approach that can better learn relationships between actions over time, thus improving the overall segmentation performance. The proposed hierarchical recurrent attention framework analyses the input video at multiple temporal scales, to form embeddings at frame level and segment level, and perform fine-grained action segmentation. This generates a simple, lightweight, yet extremely effective architecture for segmenting continuous video streams and has multiple application domains. We evaluate our system on multiple challenging public benchmark datasets, including MERL Shopping, 50 salads, and Georgia Tech Egocentric datasets and achieves state-of-the-art performance. The evaluated datasets encompass numerous video capture settings which are inclusive of static overhead camera views and dynamic, ego-centric head-mounted camera views, demonstrating the direct applicability of the proposed framework in a variety of settings. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：442 / 448

页数：7

共 50 条

[1] HANA: Hierarchical Attention Network Assembling for Semantic Segmentation
Wei Liu
Ding Li
Hongqi Su
Cognitive Computation, 2021, 13 : 1128 - 1135
[2] HANA: Hierarchical Attention Network Assembling for Semantic Segmentation
Liu, Wei
Li, Ding
Su, Hongqi
COGNITIVE COMPUTATION, 2021, 13 (05) : 1128 - 1135
[3] Motion saliency based hierarchical attention network for action recognition
Guo, Zihui
Hou, Yonghong
Xiao, Renyi
Li, Chuankun
Li, Wanqing
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 4533 - 4550
[4] Motion saliency based hierarchical attention network for action recognition
Zihui Guo
Yonghong Hou
Renyi Xiao
Chuankun Li
Wanqing Li
Multimedia Tools and Applications, 2023, 82 : 4533 - 4550
[5] Hierarchical Self-Attention Network for Action Localization in Videos
Pramono, Rizard Renanda Adhi
Chen, Yie-Tarng
Fang, Wen-Hsien
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 61 - 70
[6] A multi-branch hierarchical attention network for medical target segmentation
Yu, Yongtao
Tao, Yifei
Guan, Haiyan
Xiao, Shaozhang
Li, Fenfen
Yu, Changhui
Liu, Zuojun
Li, Jonathan
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
[7] HANet: Hierarchical Attention Network for Remote Sensing Images Semantic Segmentation
Zhang, Hongming
Yang, Guang
Gao, Zhengjie
Shen, Yinwei
Tang, Hengao
Wang, Tao
Han, Yamin
PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 386 - 400
[8] LHAS: A Lightweight Network Based on Hierarchical Attention for Hyperspectral Image Segmentation
Song, Lujie
Gao, Yunhao
Gui, Yuanyuan
Jiang, Daguang
Zhang, Mengmeng
Liu, Huan
Li, Wei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[9] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
Zhao, Zongji
Zhao, Sanyuan
PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335
[10] Discriminative Feature Network Based on a Hierarchical Attention Mechanism for Semantic Hippocampus Segmentation
Shi, Jiali
Zhang, Rong
Guo, Lijun
Gao, Linlin
Ma, Huifang
Wang, Jianhua
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (02) : 504 - 513

← 1 2 3 4 5 →