Real-Time Video Saliency Prediction Via 3D Residual Convolutional Neural Network

被引:4
|
作者
Sun, Zhenhao [1 ,2 ]
Wang, Xu [1 ,2 ]
Zhang, Qiudan [3 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Video saliency prediction; eye fixation dataset; 3D residual convolutional neural network; DETECTION MODEL;
D O I
10.1109/ACCESS.2019.2946479
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attention is a fundamental attribute of human visual system that plays important roles in many visual perception tasks. The key issue of video saliency lies in how to efficiently exploit the temporal information. Instead of singling out the temporal saliency maps, we propose a real-time end-to-end video saliency prediction model via 3D residual convolutional neural network (3D-ResNet), which incorporates the prediction of spatial and temporal saliency maps into one single process. In particular, a multi-scale feature representation scheme is employed to further boost the model performance. Besides, a frame skipping strategy is proposed for speeding up the saliency map inference process. Moreover, a new challenging eye tracking database with 220 video clips is established to facilitate the research of video saliency prediction. Extensive experimental results show our model outperforms the state-of-the-art methods over the eye fixation datasets in terms of both prediction accuracy and inference speed.
引用
收藏
页码:147743 / 147754
页数:12
相关论文
共 50 条
  • [21] 3D Parallel Fully Convolutional Networks for Real-Time Video Wildfire Smoke Detection
    Li, Xiuqing
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Liu, Chengyun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 89 - 103
  • [22] A Quantum 3D Convolutional Neural Network with Application in Video Classification
    Blekos, Kostas
    Kosmopoulos, Dimitrios
    ADVANCES IN VISUAL COMPUTING (ISVC 2021), PT I, 2021, 13017 : 601 - 612
  • [23] 3D Convolutional Neural Network based on memristor for video recognition
    Liu, Jiaqi
    Li, Zhenghao
    Tang, Yongliang
    Hu, Wei
    Wu, Jun
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 116 - 124
  • [24] Real-time Detection of Facial Expression Based on Improved Residual Convolutional Neural Network
    Wang, Sen
    Wang, Xiaofei
    Chen, Runxing
    Liu, Yong
    Huang, Shuo
    CONFERENCE PROCEEDINGS OF 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2019), 2019,
  • [25] 3D visual saliency and convolutional neural network for blind mesh quality assessment
    Ilyass Abouelaziz
    Aladine Chetouani
    Mohammed El Hassouni
    Longin Jan Latecki
    Hocine Cherifi
    Neural Computing and Applications, 2020, 32 : 16589 - 16603
  • [26] Real-time active 3D shape reconstruction for 3D video
    Wu, X
    Matsuyama, T
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
  • [27] 3D visual saliency and convolutional neural network for blind mesh quality assessment
    Abouelaziz, Ilyass
    Chetouani, Aladine
    El Hassouni, Mohammed
    Latecki, Longin Jan
    Cherifi, Hocine
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (21): : 16589 - 16603
  • [28] SCCB-U-Net: Convolutional neural network for real-time analysis of 3D mechanical properties of umbilical
    Wang, Lifu
    Zhu, Liangkuan
    Shi, Dongyan
    Qi, Mei
    Helal, Wasim M. K.
    MECHANICS OF ADVANCED MATERIALS AND STRUCTURES, 2025,
  • [29] Development of automated feature extraction and convolutional neural network optimization for real-time warping monitoring in 3D printing
    Xie, Jiarui
    Saluja, Aditya
    Rahimizadeh, Amirmohammad
    Fayazbakhsh, Kazem
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2022, 35 (08) : 813 - 830
  • [30] LVNet: A lightweight volumetric convolutional neural network for real-time and high-performance recognition of 3D objects
    Li, Lianwei
    Qin, Shiyin
    Yang, Ning
    Hong, Li
    Dai, Yang
    Wang, Zhiqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (21) : 61047 - 61063