3D visual saliency and convolutional neural network for blind mesh quality assessment

被引:22
|
作者
Abouelaziz, Ilyass [1 ]
Chetouani, Aladine [2 ]
El Hassouni, Mohammed [1 ,3 ]
Latecki, Longin Jan [4 ]
Cherifi, Hocine [5 ]
机构
[1] Mohammed V Univ Rabat, Fac Sci, LRIT, URAC 29, BP 1014 RP, Rabat, Morocco
[2] Univ Orleans, PRISME Lab, Orleans, France
[3] Mohammed V Univ Rabat, FLSHR, Rabat, Morocco
[4] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
[5] Univ Burgundy, LE2I, UMR 6306, CNRS, Dijon, France
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 21期
关键词
Mesh visual quality assessment; Mean opinion score; Mesh visual saliency; Convolutional neural network; METRICS; ERROR; COMPRESSION; MODEL;
D O I
10.1007/s00521-019-04521-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of full reference and reduced reference methods have been proposed in order to estimate the perceived visual quality of 3D meshes. However, in most practical situations, there is a limited access to the information related to the reference and the distortion type. For these reasons, the development of a no-reference mesh visual quality (MVQ) approach is a critical issue, and more emphasis needs to be devoted to blind methods. In this work, we propose a no-reference convolutional neural network (CNN) framework to estimate the perceived visual quality of 3D meshes. The method is called SCNN-BMQA (3D visual saliency and CNN for blind mesh quality assessment). The main contribution is the usage of a CNN and 3D visual saliency to estimate the perceived visual quality of distorted meshes. To do so, the CNN architecture is fed by small patches selected carefully according to their level of saliency. First, the visual saliency of the 3D mesh is computed. Afterward, we render 2D projections from the 3D mesh and its corresponding 3D saliency map. Then the obtained views are split into 2D small patches that pass through a saliency filter in order to select the most relevant patches. Finally, a CNN is used for the feature learning and the quality score estimation. Extensive experiments are conducted on four prominent MVQ assessment databases, including several tests to study the effect of the CNN parameters, the effect of visual saliency and comparison with existing methods. Results show that the trained CNN achieves good rates in terms of correlation with human judgment and outperforms the most effective state-of-the-art methods.
引用
收藏
页码:16589 / 16603
页数:15
相关论文
共 50 条
  • [31] Multiscale convolutional neural network for no-reference image quality assessment with saliency detection
    Fan, Xiaodong
    Wang, Yang
    Wang, Changzhong
    Chen, Xiangyue
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42607 - 42619
  • [32] Multiscale convolutional neural network for no-reference image quality assessment with saliency detection
    Xiaodong Fan
    Yang Wang
    Changzhong Wang
    Xiangyue Chen
    Multimedia Tools and Applications, 2022, 81 : 42607 - 42619
  • [33] Real-Time Video Saliency Prediction Via 3D Residual Convolutional Neural Network
    Sun, Zhenhao
    Wang, Xu
    Zhang, Qiudan
    Jiang, Jianmin
    IEEE ACCESS, 2019, 7 : 147743 - 147754
  • [34] Stereoscopic video quality assessment based on 3D convolutional neural networks
    Yang, Jiachen
    Zhu, Yinghao
    Ma, Chaofan
    Lu, Wen
    Meng, Qinggang
    NEUROCOMPUTING, 2018, 309 : 83 - 93
  • [35] Blind Robust 3D Mesh Watermarking Based on Mesh Saliency and Wavelet Transform for Copyright Protection
    Hamidi, Mohamed
    Chetouani, Aladine
    El Haziti, Mohamed
    El Hassouni, Mohammed
    Cherifi, Hocine
    INFORMATION, 2019, 10 (02)
  • [36] A Convolutional Vector Network for 3D Mesh Object Recognition
    Qiu Q.
    Zhao J.
    Chen Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 271 - 282
  • [37] C3DVQA: FULL-REFERENCE VIDEO QUALITY ASSESSMENT WITH 3D CONVOLUTIONAL NEURAL NETWORK
    Xu, Munan
    Chen, Junming
    Wang, Haiqiang
    Liu, Shan
    Li, Ge
    Bai, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4447 - 4451
  • [38] Blind Image Quality Assessment with Visual Sensitivity Enhanced Dual-Channel Deep Convolutional Neural Network
    Zhang, Min
    Hou, Wenjing
    Zhang, Lei
    Feng, Jun
    2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
  • [39] Learning to Predict 3D Mesh Saliency
    ALfarasani, Dalia A.
    Sweetman, Thomas
    Lai, Yu-Kun
    Rosin, Paul L.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4023 - 4029
  • [40] Saliency Regions for 3D Mesh Abstraction
    Yang, Yu-Bin
    Lu, Tong
    Lin, Jin-Jie
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009, 2009, 5879 : 292 - 299