Hierarchical Video Frame Sequence Representation with Deep Convolutional Graph Network

被引:9
|
作者
Mao, Feng [1 ]
Wu, Xiang [1 ]
Xue, Hui [1 ]
Zhang, Rong [1 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
关键词
Video classification; Sequence representation; Graph neural network; Deep convolutional neural network;
D O I
10.1007/978-3-030-11018-5_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High accuracy video label prediction (classification) models are attributed to large scale data. These data could be frame feature sequences extracted by a pre-trained convolutional-neural-network, which promote the efficiency for creating models. Unsupervised solutions such as feature average pooling, as a simple label-independent parameter-free based method, has limited ability to represent the video. While the supervised methods, like RNN, can greatly improve the recognition accuracy. However, the video length is usually long, and there are hierarchical relationships between frames across events in the video, the performance of RNN based models are decreased. In this paper, we proposes a novel video classification method based on a deep convolutional graph neural network (DCGN). The proposed method utilize the characteristics of the hierarchical structure of the video, and performed multi-level feature extraction on the video frame sequence through the graph network, obtained a video representation reflecting the event semantics hierarchically. We test our model on YouTube-8M Large-Scale Video Understanding dataset, and the result outperforms RNN based benchmarks.
引用
收藏
页码:262 / 270
页数:9
相关论文
共 50 条
  • [1] HIERARCHICAL SEQUENCE REPRESENTATION WITH GRAPH NETWORK
    Chen, Da
    Wu, Xiang
    Dong, Jianfeng
    He, Yuan
    Xue, Hui
    Mao, Feng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2288 - 2292
  • [2] A deep convolutional neural network for video sequence background subtraction
    Babaee, Mohammadreza
    Duc Tung Dinh
    Rigoll, Gerhard
    PATTERN RECOGNITION, 2018, 76 : 635 - 649
  • [3] A lightweight hierarchical graph convolutional model for knowledge graph representation learning
    Zhang, Jinglin
    Shen, Bo
    APPLIED INTELLIGENCE, 2024, 54 (21) : 10695 - 10708
  • [4] Hierarchical Bipartite Graph Convolutional Network for Recommendation
    Cheng, Yi-Wei
    Zhong, Zhiqiang
    Pang, Jun
    Li, Cheng-Te
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (02) : 49 - 60
  • [5] Video summarization with a graph convolutional attention network
    Li, Ping
    Tang, Chao
    Xu, Xianghua
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 902 - 913
  • [6] Object Detection in Video Surveillance Based on Multiscale Frame Representation and Block Processing by a Convolutional Neural Network
    Rykhard Bohush
    Guangdi Ma
    Yang Weichen
    Sergey Ablameyko
    Pattern Recognition and Image Analysis, 2022, 32 : 1 - 10
  • [7] Object Detection in Video Surveillance Based on Multiscale Frame Representation and Block Processing by a Convolutional Neural Network
    Bohush, Rykhard
    Ma, Guangdi
    Yang Weichen
    Ablameyko, Sergey
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (01) : 1 - 10
  • [8] A deep graph convolutional neural network architecture for graph classification
    Zhou, Yuchen
    Huo, Hongtao
    Hou, Zhiwen
    Bu, Fanliang
    PLOS ONE, 2023, 18 (03):
  • [9] A deep graph convolutional neural network architecture for graph classification
    Zhou, Yuchen
    Huo, Hongtao
    Hou, Zhiwen
    Bu, Fanliang
    PLOS BIOLOGY, 2023, 21 (03)
  • [10] Multi-Scale Graph Convolutional Network With Spectral Graph Wavelet Frame
    Shen, Yangmei
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2021, 7 : 595 - 610