Real-Time Video Saliency Prediction Via 3D Residual Convolutional Neural Network

被引:4
|
作者
Sun, Zhenhao [1 ,2 ]
Wang, Xu [1 ,2 ]
Zhang, Qiudan [3 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Video saliency prediction; eye fixation dataset; 3D residual convolutional neural network; DETECTION MODEL;
D O I
10.1109/ACCESS.2019.2946479
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attention is a fundamental attribute of human visual system that plays important roles in many visual perception tasks. The key issue of video saliency lies in how to efficiently exploit the temporal information. Instead of singling out the temporal saliency maps, we propose a real-time end-to-end video saliency prediction model via 3D residual convolutional neural network (3D-ResNet), which incorporates the prediction of spatial and temporal saliency maps into one single process. In particular, a multi-scale feature representation scheme is employed to further boost the model performance. Besides, a frame skipping strategy is proposed for speeding up the saliency map inference process. Moreover, a new challenging eye tracking database with 220 video clips is established to facilitate the research of video saliency prediction. Extensive experimental results show our model outperforms the state-of-the-art methods over the eye fixation datasets in terms of both prediction accuracy and inference speed.
引用
收藏
页码:147743 / 147754
页数:12
相关论文
共 50 条
  • [31] Fixed-Point Convolutional Neural Network for Real-Time Video Processing in FPGA
    Solovyev, Roman
    Kustov, Alexander
    Telpukhov, Dmitry
    Rukhlov, Vladimir
    Kalinin, Alexandr
    PROCEEDINGS OF THE 2019 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2019, : 1605 - 1611
  • [32] VECTOR: Velocity-Enhanced GRU Neural Network for Real-Time 3D UAV Trajectory Prediction
    Nacar, Omer
    Abdelkader, Mohamed
    Ghouti, Lahouari
    Gabr, Kahled
    Al-Batati, Abdulrahman
    Koubaa, Anis
    DRONES, 2025, 9 (01)
  • [33] T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition
    Liu, Kun
    Liu, Wu
    Gan, Chuang
    Tan, Mingkui
    Ma, Huadong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7138 - 7145
  • [34] Real-Time Detection of Events in Soccer Videos using 3D Convolutional Neural Networks
    Rongved, Olav A. Norgard
    Hicks, Steven A.
    Thambawita, Vajira
    Stensland, Hakon K.
    Zouganeli, Evi
    Johansen, Dag
    Riegler, Michael A.
    Halvorsen, Pal
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 135 - 144
  • [35] Real-time 2D to 3D video conversion
    Ideses, Ianir
    Yaroslavsky, Leonid P.
    Fishbain, Barak
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2007, 2 (01) : 3 - 9
  • [36] Real-time 2D to 3D video conversion
    Ianir Ideses
    Leonid P. Yaroslavsky
    Barak Fishbain
    Journal of Real-Time Image Processing, 2007, 2 : 3 - 9
  • [37] 3D Residual Convolutional Neural Network for Low Dose CT Denoising
    Zamyatin, Alex
    Yu, Leiming
    Rozas, David
    MEDICAL IMAGING 2022: PHYSICS OF MEDICAL IMAGING, 2022, 12031
  • [38] A real-time hourly ozone prediction system using deep convolutional neural network
    Ebrahim Eslami
    Yunsoo Choi
    Yannic Lops
    Alqamah Sayeed
    Neural Computing and Applications, 2020, 32 : 8783 - 8797
  • [39] Real-Time Prediction of Transarterial Drug Delivery Based on a Deep Convolutional Neural Network
    Yuan, Xin-Yi
    Hua, Yue
    Aubry, Nadine
    Zhussupbekov, Mansur
    Antaki, James F.
    Zhou, Zhi-Fu
    Peng, Jiang-Zhou
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [40] A real-time hourly ozone prediction system using deep convolutional neural network
    Eslami, Ebrahim
    Choi, Yunsoo
    Lops, Yannic
    Sayeed, Alqamah
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 8783 - 8797