Hierarchical Video Frame Sequence Representation with Deep Convolutional Graph Network

被引:9
|
作者
Mao, Feng [1 ]
Wu, Xiang [1 ]
Xue, Hui [1 ]
Zhang, Rong [1 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
关键词
Video classification; Sequence representation; Graph neural network; Deep convolutional neural network;
D O I
10.1007/978-3-030-11018-5_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High accuracy video label prediction (classification) models are attributed to large scale data. These data could be frame feature sequences extracted by a pre-trained convolutional-neural-network, which promote the efficiency for creating models. Unsupervised solutions such as feature average pooling, as a simple label-independent parameter-free based method, has limited ability to represent the video. While the supervised methods, like RNN, can greatly improve the recognition accuracy. However, the video length is usually long, and there are hierarchical relationships between frames across events in the video, the performance of RNN based models are decreased. In this paper, we proposes a novel video classification method based on a deep convolutional graph neural network (DCGN). The proposed method utilize the characteristics of the hierarchical structure of the video, and performed multi-level feature extraction on the video frame sequence through the graph network, obtained a video representation reflecting the event semantics hierarchically. We test our model on YouTube-8M Large-Scale Video Understanding dataset, and the result outperforms RNN based benchmarks.
引用
收藏
页码:262 / 270
页数:9
相关论文
共 50 条
  • [21] Adaptive Hierarchical Graph Convolutional Network for EEG Emotion Recognition
    Xue, Yunlong
    Zheng, Wenming
    Zong, Yuan
    Chang, Hongli
    Jiang, Xingxun
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [22] Learning Hierarchical Graph Convolutional Neural Network for Object Navigation
    Xu, Tao
    Yang, Xu
    Zheng, Suiwu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 544 - 556
  • [23] Hierarchical graph learning with convolutional network for brain disease prediction
    Liu, Tong
    Liu, Fangqi
    Wan, Yingying
    Hu, Rongyao
    Zhu, Yongxin
    Li, Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 46161 - 46179
  • [24] Hierarchical Graph Convolutional Network for Data Evaluation of Dynamic Graphs
    Wang, Bin
    Hayashi, Teruaki
    Ohsawa, Yukio
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4475 - 4481
  • [25] Directional Attention based Video Frame Prediction using Graph Convolutional Networks
    Bhattacharjee, Prateep
    Das, Sukhendu
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [26] Deep graph layer information mining convolutional network
    Lin, Guangfeng
    Wei, Wenchao
    Kang, Xiaobing
    Liao, Kaiyang
    Zhang, Erhu
    PATTERN RECOGNITION, 2024, 154
  • [27] A new relational reflection graph convolutional network for the knowledge representation
    Shuanglong Y.
    Dechang P.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (04) : 4191 - 4200
  • [28] Multi-Label Graph Convolutional Network Representation Learning
    Shi, Min
    Tang, Yufei
    Zhu, Xingquan
    Liu, Jianxun
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (05) : 1169 - 1181
  • [29] A modified Hierarchical graph cut based video segmentation approach for high frame rate video
    Hu, Xuezhang
    Chakravarty, Sumit
    She, Qi
    Wang, Boyu
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VI, 2013, 8661
  • [30] Dynamic graph convolutional network for multi-video summarization
    Wu, Jiaxin
    Zhong, Sheng-hua
    Liu, Yan
    PATTERN RECOGNITION, 2020, 107 (107)