Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features

被引:5
|
作者
Zhou, Yang [1 ]
Yu, Wanli [1 ]
Li, Zhu [2 ,3 ]
Yin, Haibing [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Univ Missouri, Kansas City, MO 64110 USA
[3] Pengcheng Labs, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo/3D image; visual discomfort prediction; multi-scale DCT; disparity; random forest; IMAGES; INFORMATION; DISPARITY; COMFORT;
D O I
10.1145/3343031.3350848
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior approaches to the problem of visual discomfort prediction (VDP) for stereo/3D images are built for the uncompressed image. This paper presents a novel VDP method based on the compressed image by using multi-scale discrete cosine transform (MsDCT). Three types of visual discomfort features, including basic disparity intensity (BDI), disparity gradient energy (DGE) and disparity texture complexity (DTC), are extracted from two-dimensional (2-D) DCT coefficients. Additionally, a multi-scale transformation approach based on the different sizes of transform units is applied to obtain the multi-scale sub-features for each of the features. Then, through experimental comparison, a random forest regressor is chosen to fuse twenty-three sub-features to get the final objective prediction value of the S3D images. Experimental results conducted on two datasets show that the proposed method improves the prediction accuracy compared to those of recent S3D visual (dis)comfort predictors.
引用
收藏
页码:184 / 191
页数:8
相关论文
共 50 条
  • [1] Experimental investigation of discomfort combination: toward visual discomfort prediction for stereoscopic videos
    Lee, Seong-il
    Jung, Yong Ju
    Sohn, Hosik
    Ro, Yong Man
    JOURNAL OF ELECTRONIC IMAGING, 2014, 23 (01)
  • [2] Multi-scale Spectrum Visual Saliency Perception via Hypercomplex DCT
    Xiao, Limei
    Li, Ce
    Hu, Zhijia
    Pan, Zhengrong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 645 - 655
  • [3] Visual saliency prediction using multi-scale attention gated network
    Yubao Sun
    Mengyang Zhao
    Kai Hu
    Shaojing Fan
    Multimedia Systems, 2022, 28 : 131 - 139
  • [4] Visual saliency prediction using multi-scale attention gated network
    Sun, Yubao
    Zhao, Mengyang
    Hu, Kai
    Fan, Shaojing
    MULTIMEDIA SYSTEMS, 2022, 28 (01) : 131 - 139
  • [5] Investigation of Object Thickness for Visual Discomfort Prediction in Stereoscopic Images
    Sohn, Hosik
    Jung, Yong Ju
    Lee, Seong-il
    Park, Hyun Wook
    Ro, Yong Man
    STEREOSCOPIC DISPLAYS AND APPLICATIONS XXIII, 2012, 8288
  • [6] Hierarchical multi-scale stereoscopic image quality assessment based on visual mechanism
    Yongli Chang
    Sumei Li
    Ping Zhao
    Signal, Image and Video Processing, 2022, 16 : 1177 - 1185
  • [7] Hierarchical multi-scale stereoscopic image quality assessment based on visual mechanism
    Chang, Yongli
    Li, Sumei
    Zhao, Ping
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (05) : 1177 - 1185
  • [8] Navigation and interaction in a multi-scale stereoscopic environment
    Houtgast, E
    Pfeiffer, O
    Wartell, Z
    Ribarsky, W
    Post, F
    IEEE VIRTUAL REALITY 2005, CONFERENCE PROCEEDINGS, 2005, : 275 - 276
  • [9] Attention Fusion for Audio-Visual Person Verification Using Multi-Scale Features
    Hoermann, Stefan
    Moiz, Abdul
    Knoche, Martin
    Rigoll, Gerhard
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 281 - 285
  • [10] Visual comfort assessment for stereoscopic images based on sparse coding with multi-scale dictionaries
    Jiang, Qiuping
    Shao, Feng
    Jiang, Gangyi
    Yu, Mei
    Peng, Zongju
    NEUROCOMPUTING, 2017, 252 : 77 - 86