Stereoscopic image discomfort prediction using dual-stream multi-level interactive network

被引:0
|
作者
Zhou, Yang [1 ]
Chen, Pingan [1 ]
Yin, Haibing [1 ]
Huang, Xiaofeng [1 ]
Li, Zhu [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Telecommun Engn, Hangzhou 310018, Peoples R China
[2] Univ Missouri, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA
基金
中国国家自然科学基金;
关键词
Stereoscopic image; Visual discomfort prediction; Dual-stream network; BLIND QUALITY ASSESSMENT; VISUAL DISCOMFORT; COMFORT; DISPARITY; FATIGUE;
D O I
10.1016/j.displa.2023.102444
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing stereoscopic image discomfort prediction methods may fail to work well because they are difficult to extract discomfort features from stereoscopic image's statistical information since the mechanism of human binocular vision is very complex. In this work, we propose a dual-stream multi-level interactive network that is completely end-to-end trainable for stereoscopic image discomfort prediction. This method first extracts multi-level fusion and difference features from stereoscopic images through a multi-level interaction network. Then, the low-, medium-and high-level feature maps are concatenated to simulate the complicated visual interaction mechanism of the human visual system (HVS). Finally, two fully connected layers are used as a non-linear regression function that maps the feature vectors to stereoscopic image discomfort scores. Extensive experiments demonstrate that our approach performs favorably against the existing prediction models on the IEEE-SA dataset and NBU-S3D dataset.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] DeepArtist: A Dual-Stream Network for Painter Classification of Highly-Varying Image Resolutions
    Nevo, Doron
    David, Eli O.
    Netanyahu, Nathan S.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 582 - 593
  • [42] Video frame prediction with dual-stream deep network emphasizing motions and content details
    Huang, Qingming
    Li, Zhongxiao
    Zheng, Liying
    Yang, Tianyi
    APPLIED SOFT COMPUTING, 2022, 125
  • [43] Dual-stream multi-label image classification model enhanced by feature reconstruction
    Hu, Liming
    Chen, Mingxuan
    Wang, Anjie
    Fang, Zhijun
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [44] Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features
    Zhou, Yang
    Yu, Wanli
    Li, Zhu
    Yin, Haibing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 184 - 191
  • [45] Dual-stream dynamic graph structure network for document-level relation extraction
    Zhong, Yu
    Shen, Bo
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (09)
  • [46] Multi-Scale Enhanced Dual-Stream Network for Facial Attribute Editing Localization
    Huang, Jinkun
    Luo, Weiqi
    Huang, Wenmin
    Xi, Ziyi
    Wei, Kangkang
    Huang, Jiwu
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 151 - 165
  • [47] Dual-Stream Feature Fusion Network for Detection and ReID in Multi-object Tracking
    He, Qingyou
    Li, Liangqun
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 247 - 260
  • [48] Dual-Stream Convolutional Autoencoding Network for Hyperspectral Unmixing using Attention Mechanism
    Su Xiaotong
    Guo Baofeng
    You Jingyun
    Wu Wenhao
    Xu Zhangchi
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (04)
  • [49] An Enhanced Dual-Stream Network Using Multi-Source Remote Sensing Imagery for Water Body Segmentation
    Zhang, Xiaoyong
    Geng, Miaomiao
    Yang, Xuan
    Li, Cong
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [50] A multi-level feature integration network for image inpainting
    Tao Chen
    Xin Zhang
    Bernd Hamann
    Dongjing Wang
    Hua Zhang
    Multimedia Tools and Applications, 2022, 81 : 38781 - 38802