Video Saliency Detection Using Deep Convolutional Neural Networks

被引:2
|
作者
Zhou, Xiaofei [1 ,2 ,3 ]
Liu, Zhi [2 ,3 ]
Gong, Chen [4 ]
Li, Gongyang [2 ,3 ]
Huang, Mengke [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou, Peoples R China
[2] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Minist Educ, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PT II | 2018年 / 11257卷
基金
中国国家自然科学基金;
关键词
Video saliency; Convolutional neural networks; Feature aggregation; VISUAL-ATTENTION; SEGMENTATION; IMAGE; MODEL;
D O I
10.1007/978-3-030-03335-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous deep learning based efforts have been done for image saliency detection, and thus, it is a natural idea that we can construct video saliency model on basis of these image saliency models in an effective way. Besides, as for the limited number of training videos, existing video saliency model is trained with large-scale synthetic video data. In this paper, we construct video saliency model based on existing image saliency model and perform training on the limited video data. Concretely, our video saliency model consists of three steps including feature extraction, feature aggregation and spatial refinement. Firstly, the concatenation of current frame and its optical flow image is fed into the feature extraction network, yielding feature maps. Then, a tensor, which consists of the generated feature maps and the original information including the current frame and the optical flow image, is passed to the aggregation network, in which the original information can provide complementary information for aggregation. Finally, in order to obtain a high-quality saliency map with well-defined boundaries, the output of aggregation network and the current frame are used to perform spatial refinement, yielding the final saliency map for the current frame. The extensive qualitative and quantitative experiments on two challenging video datasets show that the proposed model consistently outperforms the state-of-the-art saliency models for detecting salient objects in videos.
引用
收藏
页码:308 / 319
页数:12
相关论文
共 50 条
  • [21] Prostate Cancer Detection using Deep Convolutional Neural Networks
    Sunghwan Yoo
    Isha Gujrathi
    Masoom A. Haider
    Farzad Khalvati
    Scientific Reports, 9
  • [22] Android Malware Detection using Convolutional Deep Neural Networks
    Bourebaa, Fatima
    Benmohammed, Mohamed
    2020 4TH INTERNATIONAL CONFERENCE ON ADVANCED ASPECTS OF SOFTWARE ENGINEERING (ICAASE'2020): 4TH INTERNATIONAL CONFERENCE ON ADVANCED ASPECTS OF SOFTWARE ENGINEERING, 2020, : 52 - 58
  • [23] Railway Joint Detection Using Deep Convolutional Neural Networks
    Sun, Yanmin
    Liu, Yan
    Yang, Chunsheng
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2019, : 235 - 240
  • [24] Angiodysplasia detection and localization using deep convolutional neural networks
    Shvets, Alexey A.
    Iglovikov, Vladimir I.
    Rakhlin, Alexander
    Kalinin, Alexandr A.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 612 - 617
  • [25] Prostate Cancer Detection using Deep Convolutional Neural Networks
    Yoo, Sunghwan
    Gujrathi, Isha
    Haider, Masoom A.
    Khalvati, Farzad
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [26] Eggshell crack detection using deep convolutional neural networks
    Botta, Bhavya
    Gattam, Sai Swaroop Reddy
    Datta, Ashis Kumar
    JOURNAL OF FOOD ENGINEERING, 2022, 315
  • [27] Face Detection in Painting Using Deep Convolutional Neural Networks
    Mzoughi, Olfa
    Bigand, Andre
    Renaud, Christophe
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 333 - 341
  • [28] Structural crack detection using deep convolutional neural networks
    Ali, Raza
    Chuah, Joon Huang
    Abu Talip, Mohamad Sofian
    Mokhtar, Norrima
    Shoaib, Muhammad Ali
    AUTOMATION IN CONSTRUCTION, 2022, 133
  • [29] Face Occlusion Detection Using Deep Convolutional Neural Networks
    Xia, Yizhang
    Zhang, Bailing
    Coenen, Frans
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (09)
  • [30] "Texting & Driving" Detection Using Deep Convolutional Neural Networks
    Maria Celaya-Padilla, Jose
    Eric Galvan-Tejada, Carlos
    Anaid Lozano-Aguilar, Joyce Selene
    Alejandra Zanella-Calzada, Laura
    Luna-Garcia, Huizilopoztli
    Issac Galvan-Tejada, Jorge
    Karina Gamboa-Rosales, Nadia
    Velez Rodriguez, Alberto
    Gamboa-Rosales, Hamurabi
    APPLIED SCIENCES-BASEL, 2019, 9 (15):