ResMem-Net: memory based deep CNN for image memorability estimation

被引:2
|
作者
Praveen, Arockia [1 ]
Noorwali, Abdulfattah [2 ]
Samiayya, Duraimurugan [3 ]
Khan, Mohammad Zubair [4 ]
Vincent, Durai Raj P. M. [5 ]
Bashir, Ali Kashif [6 ]
Alagupandi, Vinoth [3 ]
机构
[1] Phosphene AI, Madurai, Tamil Nadu, India
[2] Umm Al Qura Univ, Mecca, Saudi Arabia
[3] Optisol Business Solut, Chennai, Tamil Nadu, India
[4] Taibah Univ, Dept Comp Sci, Medina, Saudi Arabia
[5] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
[6] Manchester Metropolitan Univ, Manchester, Lancs, England
关键词
Deep Learning; Image Memorability; Visual Emotions; Saliency; Object Interestingness;
D O I
10.7717/peerj-cs.767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: "What makes an image memorable?''. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [1] Multiple instance learning based deep CNN for image memorability prediction
    Basavaraju, Sathisha
    Sur, Arijit
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35511 - 35535
  • [2] Multiple instance learning based deep CNN for image memorability prediction
    Sathisha Basavaraju
    Arijit Sur
    [J]. Multimedia Tools and Applications, 2019, 78 : 35511 - 35535
  • [3] Deep dilated CNN based image denoising
    Chaurasiya R.
    Ganotra D.
    [J]. International Journal of Information Technology, 2023, 15 (1) : 137 - 148
  • [4] Grading Image Retrieval based on CNN Deep Features
    Luo, Y. W.
    Li, Y.
    Han, F. J.
    Huang, S. B.
    [J]. 2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 148 - 152
  • [5] MU-Net: Deep Learning-based Thermal IR Image Estimation from RGB Image
    Iwashita, Yumi
    Nakashima, Kazuto
    Rata, Sir
    Stoica, Adrian
    Kurazume, Ryo
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1022 - 1028
  • [6] Toward a Trustworthy Classifier With Deep CNN: Uncertainty Estimation Meets Hyperspectral Image
    He, Xin
    Chen, Yushi
    Huang, Lingbo
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60
  • [7] Toward a Trustworthy Classifier With Deep CNN: Uncertainty Estimation Meets Hyperspectral Image
    He, Xin
    Chen, Yushi
    Huang, Lingbo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Deep CNN-Based Blind Image Quality Predictor
    Kim, Jongyoo
    Anh-Duc Nguyen
    Lee, Sanghoon
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) : 11 - 24
  • [9] CNN deep learning-based image to vector depiction
    Waheed, Safa Riyadh
    Rahim, Mohd Shafry Mohd
    Suaib, Norhaida Mohd
    Salim, A. A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20283 - 20302
  • [10] Deep CNN Prior Based Image Reconstruction for Multispectral Imaging
    Manisali, Irfan
    Cam, Refik Mert
    Bezek, Can Deniz
    Oktem, Figen S.
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,