Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features

被引:5
|
作者
Zhou, Yang [1 ]
Yu, Wanli [1 ]
Li, Zhu [2 ,3 ]
Yin, Haibing [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Univ Missouri, Kansas City, MO 64110 USA
[3] Pengcheng Labs, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo/3D image; visual discomfort prediction; multi-scale DCT; disparity; random forest; IMAGES; INFORMATION; DISPARITY; COMFORT;
D O I
10.1145/3343031.3350848
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior approaches to the problem of visual discomfort prediction (VDP) for stereo/3D images are built for the uncompressed image. This paper presents a novel VDP method based on the compressed image by using multi-scale discrete cosine transform (MsDCT). Three types of visual discomfort features, including basic disparity intensity (BDI), disparity gradient energy (DGE) and disparity texture complexity (DTC), are extracted from two-dimensional (2-D) DCT coefficients. Additionally, a multi-scale transformation approach based on the different sizes of transform units is applied to obtain the multi-scale sub-features for each of the features. Then, through experimental comparison, a random forest regressor is chosen to fuse twenty-three sub-features to get the final objective prediction value of the S3D images. Experimental results conducted on two datasets show that the proposed method improves the prediction accuracy compared to those of recent S3D visual (dis)comfort predictors.
引用
收藏
页码:184 / 191
页数:8
相关论文
共 50 条
  • [31] On multi-scale representations of geographic features
    WANG Yanhui
    Key Lab Laboratory of Resource Environment and GIS of Beijing
    Science China Technological Sciences, 2006, (S2) : 39 - 47
  • [32] On multi-scale representations of geographic features
    Wang Yanhui
    Li Xiaojuan
    Gong Huili
    SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 2006, : 39 - 47
  • [33] A hybrid model for dissolved oxygen prediction in aquaculture based on multi-scale features
    Li C.
    Li Z.
    Wu J.
    Zhu L.
    Yue J.
    Information Processing in Agriculture, 2018, 5 (01): : 11 - 20
  • [34] Predicting Visual Discomfort Using Object Size and Disparity Information in Stereoscopic Images
    Sohn, Hosik
    Jung, Yong Ju
    Lee, Seong-il
    Ro, Yong Man
    IEEE TRANSACTIONS ON BROADCASTING, 2013, 59 (01) : 28 - 37
  • [35] ROBUST IMAGE-BASED CRACK DETECTION IN CONCRETE STRUCTURE USING MULTI-SCALE ENHANCEMENT AND VISUAL FEATURES
    Liu, Xiangzeng
    Ai, Yunfeng
    Scherer, Sebastian
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2304 - 2308
  • [36] Scalenet: A Convolutional Network to Extract Multi-Scale and Fine-Grained Visual Features
    Zhang, Jinpeng
    Zhang, Jinming
    Hu, Guyue
    Chen, Yang
    Yu, Shan
    IEEE ACCESS, 2019, 7 : 147560 - 147570
  • [37] Siamese visual tracking combining granular level multi-scale features and global information
    Liang, Wei
    Ding, Derui
    Wei, Guoliang
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [38] Scene classification using multi-scale deeply described visual words
    Zhao, Wenzhi
    Du, Shihong
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (17) : 4119 - 4131
  • [39] Using Segmentation With Multi-Scale Selective Kernel for Visual Object Tracking
    Bao, Feng
    Cao, Yifei
    Zhang, Shunli
    Lin, Beibei
    Zhao, Sicong
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 553 - 557
  • [40] Stereoscopic 3D Visual Discomfort Prediction: A Dynamic Accommodation and Vergence Interaction Model
    Oh, Heeseok
    Lee, Sanghoon
    Bovik, Alan Conrad
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) : 615 - 629