Graph Attention Networks Adjusted Bi-LSTM for Video Summarization

被引:26
|
作者
Zhong, Rui [1 ]
Wang, Rui [1 ]
Zou, Yang [1 ]
Hong, Zhiqiang [1 ]
Hu, Min [2 ]
机构
[1] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Feature extraction; Semantics; Transforms; Redundancy; Mathematical model; Histograms; Graph attention networks; Bi-LSTM; video summarization; unsupervised learning;
D O I
10.1109/LSP.2021.3066349
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The high redundancy among keyframes is a critical issue for the prior summarizing methods in dealing with user-created videos. To address the critical issue, we present a Graph Attention Networks (GAT) adjusted Bi-directional Long Short-term Memory (Bi-LSTM) model for unsupervised video summarization. First, the GAT is adopted to transform an image's visual features into higher-level features by the Contextual Features based Transformation (CFT) mechanism. Specifically, a novel Salient-Area-Size-based spatial attention model is presented to extract frame-wise visual features on the observation that humans tend to focus on sizable and moving objects. Second, the higher-level visual features are integrated with semantic features processed by Bi-LSTM to refine the frame-wise probability of being selected as keyframes. Extensive experiments demonstrate that our method outperforms state-of-the-art methods.
引用
收藏
页码:663 / 667
页数:5
相关论文
共 50 条
  • [1] Attention Based Graph Bi-LSTM Networks for Traffic Forecasting
    Zhao, Han
    Yang, Huan
    Wang, Yu
    Wang, Danwei
    Su, Rong
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [2] Deep hierarchical LSTM networks with attention for video summarization
    Lin, Jingxu
    Zhong, Sheng-hua
    Fares, Ahmed
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 97
  • [3] Spatial-Temporal Attention in Bi-LSTM Networks based on Multiple Features for Video Captioning
    Li Chu-yi
    Yu Wei-yu
    [J]. 2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [4] Video Summarization with LSTM and Deep Attention Models
    Casas, Luis Lebron
    Koblents, Eugenia
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 67 - 79
  • [5] Self-Attention Bi-LSTM Networks for Radar Signal Modulation Recognition
    Wei, Shunjun
    Qu, Qizhe
    Zeng, Xiangfeng
    Liang, Jiadian
    Shi, Jun
    Zhang, Xiaoling
    [J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2021, 69 (11) : 5160 - 5172
  • [6] Mining Aspects in Online Comments with Attention and Bi-LSTM
    Yao, Lan
    Rong, Heting
    Chu, Chao
    Gao, Fuxiang
    [J]. 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 288 - 292
  • [7] Deep Bi-LSTM Networks for Sequential Recommendation
    Zhao, Chuanchuan
    You, Jinguo
    Wen, Xinxian
    Li, Xiaowu
    [J]. ENTROPY, 2020, 22 (08)
  • [8] Att-BiL-SL: Attention-Based Bi-LSTM and Sequential LSTM for Describing Video in the Textual Formation
    Ahmed, Shakil
    Saif, A. F. M. Saifuddin
    Hanif, Md Imtiaz
    Shakil, Md Mostofa Nurannabi
    Jaman, Md Mostofa
    Haque, Md Mazid Ul
    Shawkat, Siam Bin
    Hasan, Jahid
    Sonok, Borshan Sarker
    Rahman, Farzad
    Sabbir, Hasan Muhommod
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [9] Unsupervised Video Summarization with Adversarial LSTM Networks
    Mahasseni, Behrooz
    Lam, Michael
    Todorovic, Sinisa
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
  • [10] Video summarization with a graph convolutional attention network
    Li, Ping
    Tang, Chao
    Xu, Xianghua
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 902 - 913