Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features

被引:5
|
作者
Zhou, Yang [1 ]
Yu, Wanli [1 ]
Li, Zhu [2 ,3 ]
Yin, Haibing [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Univ Missouri, Kansas City, MO 64110 USA
[3] Pengcheng Labs, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo/3D image; visual discomfort prediction; multi-scale DCT; disparity; random forest; IMAGES; INFORMATION; DISPARITY; COMFORT;
D O I
10.1145/3343031.3350848
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior approaches to the problem of visual discomfort prediction (VDP) for stereo/3D images are built for the uncompressed image. This paper presents a novel VDP method based on the compressed image by using multi-scale discrete cosine transform (MsDCT). Three types of visual discomfort features, including basic disparity intensity (BDI), disparity gradient energy (DGE) and disparity texture complexity (DTC), are extracted from two-dimensional (2-D) DCT coefficients. Additionally, a multi-scale transformation approach based on the different sizes of transform units is applied to obtain the multi-scale sub-features for each of the features. Then, through experimental comparison, a random forest regressor is chosen to fuse twenty-three sub-features to get the final objective prediction value of the S3D images. Experimental results conducted on two datasets show that the proposed method improves the prediction accuracy compared to those of recent S3D visual (dis)comfort predictors.
引用
收藏
页码:184 / 191
页数:8
相关论文
共 50 条
  • [41] Object tracking algorithm using multi-scale local texture features
    Wang, Shoujue
    Jiang, Yuwen
    Tan, Leyi
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2015, 27 (02): : 208 - 216
  • [42] Affective Image Classification Using Multi-scale Emotion Factorization Features
    Chang, Le
    Chen, Yufeng
    Li, Fengxia
    Sun, Meiling
    Yang, Chenguang
    2016 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2016), 2016, : 170 - 174
  • [43] Texture segmentation using neural networks and multi-scale wavelet features
    Kim, TH
    Eom, IK
    Kim, YS
    ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 400 - 409
  • [44] Endoscopic Image Retrieval System Using Multi-scale Image Features
    Chowdhury, Manish
    Kundu, Malay Kumar
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 64 - 70
  • [45] Multi-Scale Saliency Using Local Gradient and Global Colour Features
    Cooley, Christopher
    Coleman, Sonya
    Gardiner, Bryan
    Scotney, Bryan
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 28 - 32
  • [46] Learning Semantic Alignment Using Global Features and Multi-Scale Confidence
    Xu, Huaiyuan
    Liao, Jing
    Liu, Huaping
    Sun, Yuxiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) : 897 - 910
  • [47] Spectral analysis and recognition using multi-scale features and neural networks
    Jiang, YG
    Guo, P
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 2, 2004, 3174 : 369 - 374
  • [48] Learning Multi-Scale Features Using Dilated Convolution for Contour Detection
    Zhao, Haojun
    Lin, Chuan
    Li, Fuzhang
    Xie, Yongsheng
    Wu, Lingmei
    IEEE ACCESS, 2023, 11 : 64282 - 64293
  • [49] Multi-scale approach for the prediction of atomic scale properties
    Grisafi, Andrea
    Nigam, Jigyasa
    Ceriotti, Michele
    CHEMICAL SCIENCE, 2021, 12 (06) : 2078 - 2090
  • [50] Isolated Sign Language Recognition with Multi-scale Features using LSTM
    Mercanoglu Sincan, Ozge
    Tur, Anil Osman
    Yalim Keles, Hacer
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,