Hybrid convolutional neural networks and optical flow for video visual attention prediction

被引:0
|
作者
Meijun Sun
Ziqi Zhou
Dong Zhang
Zheng Wang
机构
[1] Tianjin University,School of Computer Science and Technology
[2] Tianjin University of Traditional Chinese Medicine,School of Computer Software
[3] Tianjin University,undefined
来源
关键词
Convolutional neural networks; Optical flow; Spatial temporal feature; Visual attention;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a convolutional neural networks (CNN) and optical flow based method is proposed for prediction of visual attention in the videos. First, a deep-learning framework is employed to extract spatial features in frames to replace those commonly used handcrafted features. The optical flow is calculated to obtain the temporal feature of the moving objects in video frames, which always draw audiences’ attentions. By integrating these two groups of features, a hybrid spatial temporal feature set is obtained and taken as the input of a support vector machine (SVM) to predict the degree of visual attention. Finally, two publicly available video datasets were used to test the performance of the proposed model, where the results have demonstrated the efficacy of the proposed approach.
引用
收藏
页码:29231 / 29244
页数:13
相关论文
共 50 条
  • [1] Hybrid convolutional neural networks and optical flow for video visual attention prediction
    Sun, Meijun
    Zhou, Ziqi
    Zhang, Dong
    Wang, Zheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29231 - 29244
  • [2] OPTIMIZED CONVOLUTIONAL NEURAL NETWORKS FOR VIDEO INTRA PREDICTION
    Meyer, Maria
    Wiesner, Jonathan
    Rohlfing, Christian
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3334 - 3338
  • [3] Attention based convolutional networks for traffic flow prediction
    Lin, Juncong
    Lin, Chengqiao
    Ye, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7379 - 7394
  • [4] Attention based convolutional networks for traffic flow prediction
    Juncong Lin
    Chengqiao Lin
    Qi Ye
    Multimedia Tools and Applications, 2024, 83 : 7379 - 7394
  • [5] An extensive evaluation of deep featuresof convolutional neural networks for saliency prediction of human visual attention
    Mahdi, Ali
    Qin, Jun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [6] Optical flow estimation using channel attention mechanism and dilated convolutional neural networks
    Zhai, Mingliang
    Xiang, Xuezhi
    Zhang, Rongfang
    Lv, Ning
    El Saddik, Abdulmotaleb
    NEUROCOMPUTING, 2019, 368 : 124 - 132
  • [7] Recurrent Fully Convolutional Networks Based on Optical Flow for Video Eyes Fixation Prediction
    Shi, Jiu-chen
    Zhang, Dong
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 465 - 469
  • [8] Dropping Activations in Convolutional Neural Networks with Visual Attention Maps
    Montoya Obeso, Abraham
    Benois-Pineau, Jenny
    Garcia Vazquez, Mireya Sarai
    Acosta, Alejandro A. Ramirez
    2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [9] Multiscale Hybrid Convolutional Deep Neural Networks with Channel Attention
    Yang, Hua
    Yang, Ming
    He, Bitao
    Qin, Tao
    Yang, Jing
    ENTROPY, 2022, 24 (09)
  • [10] A Hybrid Model for Soybean Yield Prediction Integrating Convolutional Neural Networks, Recurrent Neural Networks, and Graph Convolutional Networks
    Ingole, Vikram S.
    Kshirsagar, Ujwala A.
    Singh, Vikash
    Yadav, Manish Varun
    Krishna, Bipin
    Kumar, Roshan
    COMPUTATION, 2025, 13 (01)