Towards explainable deep visual saliency models

被引:2
|
作者
Malladi, Sai Phani Kumar [1 ]
Mukherjee, Jayanta [2 ]
Larabi, Mohamed-Chaker [3 ]
Chaudhury, Santanu [4 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, India
[2] IIT Kharagpur, Dept Comp Sci & Engn, Kharagpur, India
[3] Univ Poitiers, XLIM UMR CNRS 7252, Poitiers, France
[4] IIT Jodhpur, Dept Comp Sci & Engn, Jodhpur, India
关键词
Explainable saliency; Human perception; Log-Gabor filters; Color perception; ATTENTION;
D O I
10.1016/j.cviu.2023.103782
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have shown their profound impact on achieving human-level performance in visual saliency prediction. However, it is still unclear how they learn their task and what it means in terms of understanding human visual system. In this work, we propose a framework to derive explainable saliency models from their corresponding deep architectures. Mainly, we explain a deep saliency model by under-standing its four different aspects: (1) intermediate activation maps of deep layers, (2) biologically plausible Log-Gabor (LG) filters for salient region identification, (3) positional biased behavior of Log-Gabor filters and (4) processing of color information by establishing a relevance with human visual system. We consider four state-of-the-art (SOTA) deep saliency models, namely CMRNet, UNISAL, DeepGaze IIE, and MSI-Net for their interpretation using our proposed framework. We observe that explainable models perform way better than the classical SOTA models. We also find that CMRNet transforms the input RGB space to a representation after the input layer, which is very close to YUV space of a color image. Then, we discuss about the biological consideration and relevance of our framework for its possible anatomical substratum of visual attention. We find a good correlation between components of HVS and the base operations of the proposed technique. Hence, we say that this generic explainable framework provides a new perspective to see relationship between classical methods/human visual system and DNN based ones.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Understanding and Visualizing Deep Visual Saliency Models
    He, Sen
    Tavakoli, Hamed R.
    Borji, Ali
    Mi, Yang
    Pugeault, Nicolas
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10198 - 10207
  • [2] Spatiotemporal Saliency: Towards a Hierarchical Representation of Visual Saliency
    Bruce, Neil D. B.
    Tsotsos, John K.
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 98 - +
  • [3] What Do Deep Saliency Models Learn about Visual Attention?
    Chen, Shi
    Jiang, Ming
    Zhao, Qi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Towards better explainable deep learning models for embryo selection in ART
    Sharma, A.
    Haugen, T.
    Hammer, H.
    Rieleger, M.
    Stensen, M.
    HUMAN REPRODUCTION, 2021, 36 : 253 - 254
  • [5] Face Saliency in Various Human Visual Saliency Models
    Sharma, Puneet
    Cheikh, Faouzi Alaya
    Hardeberg, Jon Yvgve
    2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 337 - 342
  • [6] Towards Explainable Visual Emotion Understanding
    Zhang, Yue
    Ding, Wanying
    Xu, Ran
    Hu, Xiaohua
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1155 - 1162
  • [7] Does Audio help in deep Audio-Visual Saliency prediction models?
    Agrawal, Ritvik
    Jyoti, Shreyank
    Girmaji, Rohit
    Sivaprasad, Sarath
    Gandhi, Vineet
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 48 - 56
  • [8] Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis
    Brima, Yusuf
    Atemkeng, Marcellin
    BIODATA MINING, 2024, 17 (01):
  • [9] Deep Visual Saliency on Stereoscopic Images
    Anh-Duc Nguyen
    Kim, Jongyoo
    Oh, Heeseok
    Kim, Haksub
    Lin, Weisi
    Lee, Sanghoon
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1939 - 1953
  • [10] Visual Analytics for Explainable Deep Learning
    Choo, Jaegul
    Liu, Shixia
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (04) : 84 - 92