Deep Visual Discomfort Predictor for Stereoscopic 3D Images

被引:13
|
作者
Oh, Heeseok [1 ]
Ahn, Sewoong [1 ]
Lee, Sanghoon [1 ]
Bovik, Alan Conrad [2 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 120749, South Korea
[2] Univ Texas Austin, Dept Elect & Comp Engn, Lab Image & Video Engn, Austin, TX 78712 USA
基金
新加坡国家研究基金会;
关键词
Stereoscopic; 3D; visual discomfort prediction; convolutional neural network; proxy ground-truth label; QUALITY ASSESSMENT; HORIZONTAL DISPARITY; NEURAL-NETWORKS; COMFORT; STATISTICS; VERGENCE; FATIGUE; MT;
D O I
10.1109/TIP.2018.2851670
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most prior approaches to the problem of stereoscopic 3D (S3D) visual discomfort prediction (VDP) have focused on the extraction of perceptually meaningful handcrafted features based on models of visual perception and of natural depth statistics. Toward advancing performance on this problem, we have developed a deep learning-based VDP model named deep visual discomfort predictor (DeepVDP). The DeepVDP uses a convolutional neural network (CNN) to learn features that are highly predictive of experienced visual discomfort. Since a large amount of reference data is needed to train a CNN, we develop a systematic way of dividing the S3D image into local regions defined as patches and model a patch-based CNN using two sequential training steps. Since it is very difficult to obtain human opinions on each patch, instead a proxy ground-truth label that is generated by an existing S3D visual discomfort prediction algorithm called 3D-VDP is assigned to each patch. These proxy ground-truth labels are used to conduct the first stage of training the CNN. In the second stage, the automatically learned local abstractions are aggregated into global features via a feature aggregation layer. The learned features are iteratively updated via supervised learning on subjective 3D discomfort scores, which serve as ground-truth labels on each S3D image. The patch-based CNN model that has been pretrained on proxy ground-truth labels is subsequently retrained on true global subjective scores. The global S3D visual discomfort scores predicted by the trained DeepVDP model achieve the state-of-the-art performance as compared with previous VDP algorithms.
引用
收藏
页码:5420 / 5432
页数:13
相关论文
共 50 条
  • [1] Visual discomfort prediction on stereoscopic 3D images without explicit disparities
    Chen, Jianyu
    Zhou, Jun
    Sun, Jun
    Bovik, Alan Conrad
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 51 : 50 - 60
  • [2] VISUAL COMFORT ASSESSMENT FOR STEREOSCOPIC 3D IMAGES BASED ON SALIENT DISCOMFORT REGIONS
    Jung, Cheolkon
    Liu, Hongmin
    Cui, Yu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4047 - 4051
  • [3] Visual discomfort of stereoscopic 3D videos: Influence of 3D motion
    Li, Jing
    Barkowsky, Marcus
    Le Callet, Patrick
    [J]. DISPLAYS, 2014, 35 (01) : 49 - 57
  • [4] Methods for reducing visual discomfort in stereoscopic 3D: A review
    Terzic, Kasim
    Hansard, Miles
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 402 - 416
  • [5] Heterophoria changes, visual discomfort and 3D stereoscopic displays
    Kalpicka, Edyta
    Howarth, Peter A.
    [J]. Current and Future Perspectives of Virtual Reality, Augmented Reality and Mixed Reality: Industrial and Poster Track, Joint VR Conference of euroVR and EGVE, 2011, 2011, 269 : 89 - 90
  • [6] Visual 3D motion acuity predicts discomfort in 3D stereoscopic environments
    Allen, Brian
    Hanley, Taylor
    Rokers, Bas
    Green, C. Shawn
    [J]. ENTERTAINMENT COMPUTING, 2016, 13 : 1 - 9
  • [7] 3D saliency guided deep quality predictor for no-reference stereoscopic images
    Messai, Oussama
    Chetouani, Aladine
    Hachouf, Fella
    Seghir, Zianou Ahmed
    [J]. NEUROCOMPUTING, 2022, 478 : 22 - 36
  • [8] Visual discomfort relaxation for stereoscopic 3D images by adjusting zero-disparity plane for projection
    Shao, Feng
    Li, Zhutuan
    Jiang, Qiuping
    Jiang, Gangyi
    Yu, Mei
    Peng, Zongju
    [J]. DISPLAYS, 2015, 39 : 125 - 132
  • [9] Perceptual Watermarking for Stereoscopic 3D Image Based on Visual Discomfort
    Ji, Sang-Keun
    Kang, Ji-Hyeon
    Lee, Heung-Kyu
    [J]. INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 323 - 330
  • [10] Eye movements and visual discomfort when viewing stereoscopic 3D content
    Zhou, Jun
    Wang, Ling
    Yin, Haibing
    Bovik, Alan C.
    [J]. DIGITAL SIGNAL PROCESSING, 2019, 91 : 41 - 53