Stereoscopic image discomfort prediction using dual-stream multi-level interactive network

被引:0
|
作者
Zhou, Yang [1 ]
Chen, Pingan [1 ]
Yin, Haibing [1 ]
Huang, Xiaofeng [1 ]
Li, Zhu [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Telecommun Engn, Hangzhou 310018, Peoples R China
[2] Univ Missouri, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA
基金
中国国家自然科学基金;
关键词
Stereoscopic image; Visual discomfort prediction; Dual-stream network; BLIND QUALITY ASSESSMENT; VISUAL DISCOMFORT; COMFORT; DISPARITY; FATIGUE;
D O I
10.1016/j.displa.2023.102444
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing stereoscopic image discomfort prediction methods may fail to work well because they are difficult to extract discomfort features from stereoscopic image's statistical information since the mechanism of human binocular vision is very complex. In this work, we propose a dual-stream multi-level interactive network that is completely end-to-end trainable for stereoscopic image discomfort prediction. This method first extracts multi-level fusion and difference features from stereoscopic images through a multi-level interaction network. Then, the low-, medium-and high-level feature maps are concatenated to simulate the complicated visual interaction mechanism of the human visual system (HVS). Finally, two fully connected layers are used as a non-linear regression function that maps the feature vectors to stereoscopic image discomfort scores. Extensive experiments demonstrate that our approach performs favorably against the existing prediction models on the IEEE-SA dataset and NBU-S3D dataset.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Dual-Stream Multi-Path Recursive Residual Network for JPEG Image Compression Artifacts Reduction
    Jin, Zhi
    Iqbal, Muhammad Zafar
    Zou, Wenbin
    Li, Xia
    Steinbach, Eckehard
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 467 - 479
  • [22] DIRECT MULTI-SCALE DUAL-STREAM NETWORK FOR PEDESTRIAN DETECTION
    Jung, Sang-Il
    Hong, Ki-Sang
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 156 - 160
  • [23] Accurate segmentation of liver tumor from multi-modality non-contrast images using a dual-stream multi-level fusion framework
    Xu, Chenchu
    Wu, Xue
    Wang, Boyan
    Chen, Jie
    Gao, Zhifan
    Liu, Xiujian
    Zhang, Heye
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 116
  • [24] BLIND STEREOSCOPIC IMAGE QUALITY ASSESSMENT BY DEEP NEURAL NETWORK OF MULTI-LEVEL FEATURE FUSION
    Yan, Jiebin
    Fang, Yuming
    Huang, Liping
    Min, Xiongkuo
    Yao, Yiru
    Zhai, Guangtao
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [25] FACIAL IMAGE INPAINTING USING MULTI-LEVEL GENERATIVE NETWORK
    Liu, Jie
    Jung, Cheolkon
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1168 - 1173
  • [26] Infrared image fault diagnosis based on dual-stream attention convolution network
    Lu, Dong
    Yang, Jing
    Ming, Lyu
    Zhang, Jie
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (02):
  • [27] Wavelet Dual-Stream Network for Brain MR Image Super-Resolution
    Wang, Wanliang
    Xing, Fangsen
    Chen, Jiacheng
    Guan, Qiu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [28] A Frequency Attention-Based Dual-Stream Network for Image Inpainting Forensics
    Wang, Hongquan
    Zhu, Xinshan
    Ren, Chao
    Zhang, Lan
    Ma, Shugen
    MATHEMATICS, 2023, 11 (12)
  • [29] CDS-Net: Cooperative dual-stream network for image manipulation detection
    Wang, Haoran
    Deng, Jiahao
    Lin, Xun
    Tang, Wenzhong
    Wang, Shuai
    PATTERN RECOGNITION LETTERS, 2023, 176 : 167 - 173
  • [30] CANet: Context aware network with dual-stream pyramid for medical image segmentation
    Xie, Xiwang
    Zhang, Weidong
    Pan, Xipeng
    Xie, Lijie
    Shao, Feng
    Zhao, Wenyi
    An, Jubai
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81