Global-local spatio-temporal graph convolutional networks for video summarization

被引:0
|
作者
Wu, Guangli [1 ]
Song, Shanshan [1 ]
Zhang, Jing [1 ]
机构
[1] Gansu Univ Polit Sci & Law, Sch Cyberspace Secur, Lanzhou 730000, Gansu, Peoples R China
关键词
Video summarization; Graph neural networks; Temporal gating convolutional network; Spatial graph convolutional network; ATTENTION; LSTM;
D O I
10.1016/j.compeleceng.2024.109445
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization aims to create concise and accurate summary to enable users to quickly grasp the key content of the original video for facilitating efficient video browsing. Most existing video summarization methods mainly employ recurrent neural networks to capture long-term dependencies in videos, yielding remarkable results. Nevertheless, these methods overlook the potential spatial features inside the video when modeling the video. To tackle this issue, we introduce a global-local spatio-temporal graph convolutional networks for video summarization (GL-STGCN). Initially inspired by the concept of 3-D convolution, we segment the video into non-overlapping segments to capture localized spatial features of sequential frames. Subsequently, a spatial graph is constructed for each segment, with a fixed time interval between neighboring spatial graphs. Then we use the pooling method to randomly delete the redundant nodes in the graph. We then leverage a temporal gating convolutional network to extract the global temporal relationships within the video. Employing the spatial features, a spatial graph convolutional network is utilized to capture the spatial connections among frames. As the graph node information evolves, the node features furnish a more precise depiction of the video content. Consequently, we employ the temporal gating convolutional network once more to refine the global temporal relations within the video. Extensive experiments on two public datasets are conducted in this paper, showing that our proposed method outperforms most state-of-the-art video summarization methods in terms of performance. Experimental results demonstrate the effectiveness of integrating global temporal and local spatial relationships.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Continual spatio-temporal graph convolutional networks
    Hedegaard, Lukas
    Heidari, Negar
    Iosifidis, Alexandros
    [J]. PATTERN RECOGNITION, 2023, 140
  • [2] Implementating Spatio-Temporal Graph Convolutional Networks on Graphcore IPUs
    Moe, Johannes
    Pogorelov, Konstantin
    Schroeder, Daniel Thilo
    Langguth, Johannes
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 45 - 54
  • [3] Spatio-Temporal Joint Graph Convolutional Networks for Traffic Forecasting
    Zheng, Chuanpan
    Fan, Xiaoliang
    Pan, Shirui
    Jin, Haibing
    Peng, Zhaopeng
    Wu, Zonghan
    Wang, Cheng
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 372 - 385
  • [4] Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation
    Ghosh, Pallabi
    Yao, Yi
    Davis, Larry S.
    Divakaran, Ajay
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 565 - 574
  • [5] CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video Hashing
    Wei, Rukai
    Liu, Yu
    Song, Jingkuan
    Cui, Heng
    Xie, Yanzhao
    Zhou, Ke
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1677 - 1688
  • [6] SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORKS FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
    Parelli, Maria
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    Pavlakos, Georgios
    Maragos, Petros
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8457 - 8461
  • [7] Spatio-temporal adaptive graph convolutional networks for traffic flow forecasting
    Ma, Qiwei
    Sun, Wei
    Gao, Junbo
    Ma, Pengwei
    Shi, Mengjie
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (04) : 691 - 703
  • [8] DYNAMIC SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORKS FOR CARDIAC MOTION ANALYSIS
    Lu, Ping
    Bai, Wenjia
    Rueckert, Daniel
    Noble, J. Alison
    [J]. 2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 122 - 125
  • [9] Video summarization via spatio-temporal deep architecture
    Zhong, Sheng-hua
    Wu, Jiaxin
    Jiang, Jianmin
    [J]. NEUROCOMPUTING, 2019, 332 : 224 - 235
  • [10] Spatio-Temporal Action Graph Networks
    Herzig, Roei
    Levi, Elad
    Xu, Huijuan
    Gao, Hang
    Brosh, Eli
    Wang, Xiaolong
    Globerson, Amir
    Darrell, Trevor
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2347 - 2356