Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features

被引:5
|
作者
Zhou, Yang [1 ]
Yu, Wanli [1 ]
Li, Zhu [2 ,3 ]
Yin, Haibing [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Zhejiang, Peoples R China
[2] Univ Missouri, Kansas City, MO 64110 USA
[3] Pengcheng Labs, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo/3D image; visual discomfort prediction; multi-scale DCT; disparity; random forest; IMAGES; INFORMATION; DISPARITY; COMFORT;
D O I
10.1145/3343031.3350848
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior approaches to the problem of visual discomfort prediction (VDP) for stereo/3D images are built for the uncompressed image. This paper presents a novel VDP method based on the compressed image by using multi-scale discrete cosine transform (MsDCT). Three types of visual discomfort features, including basic disparity intensity (BDI), disparity gradient energy (DGE) and disparity texture complexity (DTC), are extracted from two-dimensional (2-D) DCT coefficients. Additionally, a multi-scale transformation approach based on the different sizes of transform units is applied to obtain the multi-scale sub-features for each of the features. Then, through experimental comparison, a random forest regressor is chosen to fuse twenty-three sub-features to get the final objective prediction value of the S3D images. Experimental results conducted on two datasets show that the proposed method improves the prediction accuracy compared to those of recent S3D visual (dis)comfort predictors.
引用
收藏
页码:184 / 191
页数:8
相关论文
共 50 条
  • [11] Multi-task visual discomfort prediction model for stereoscopic images based on multi-view feature representation
    Liu, Hongmei
    Qin, Huabiao
    Xu, Xiangmin
    Cai, Shicong
    Huang, Shixin
    APPLIED INTELLIGENCE, 2023, 53 (10) : 12372 - 12386
  • [12] Multi-task visual discomfort prediction model for stereoscopic images based on multi-view feature representation
    Hongmei Liu
    Huabiao Qin
    Xiangmin Xu
    Shicong Cai
    Shixin Huang
    Applied Intelligence, 2023, 53 : 12372 - 12386
  • [13] Abnormal Event Detection Based on Multi-Scale Features Prediction
    Wang J.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (04): : 586 - 591
  • [14] Visual comfort prediction for stereoscopic image using stereoscopic visual saliency
    Yang Zhou
    Yongjian He
    Song Zhang
    Yun Zhang
    Multimedia Tools and Applications, 2017, 76 : 23499 - 23516
  • [15] Visual comfort prediction for stereoscopic image using stereoscopic visual saliency
    Zhou, Yang
    He, Yongjian
    Zhang, Song
    Zhang, Yun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 23499 - 23516
  • [16] Multi-scale Color Features Based on Correlation Filter for Visual Tracking
    Wibowo, Suryo Adhi
    Lee, Hansoo
    Kim, Eun Kyeong
    Kim, Sungshin
    2017 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2017, : 272 - 277
  • [17] Image Splicing Forgery Detection Using DCT Coefficients with Multi-Scale LBP
    Shah, Atif
    El-Alfy, El-Sayed M.
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [18] Bearing life prediction based on multi-scale features and attention mechanism
    Mo R.-P.
    Si X.-S.
    Li T.-M.
    Zhu X.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (07): : 1447 - 1456
  • [19] Stereoscopic image discomfort prediction using dual-stream multi-level interactive network
    Zhou, Yang
    Chen, Pingan
    Yin, Haibing
    Huang, Xiaofeng
    Li, Zhu
    DISPLAYS, 2023, 78
  • [20] Predicting Visual Discomfort of Stereoscopic Images Using Human Attention Model
    Jung, Yong Ju
    Sohn, Hosik
    Lee, Seong-Il
    Park, Hyun Wook
    Ro, Yong Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (12) : 2077 - 2082