ResMem-Net: memory based deep CNN for image memorability estimation

被引:2
|
作者
Praveen, Arockia [1 ]
Noorwali, Abdulfattah [2 ]
Samiayya, Duraimurugan [3 ]
Khan, Mohammad Zubair [4 ]
Vincent, Durai Raj P. M. [5 ]
Bashir, Ali Kashif [6 ]
Alagupandi, Vinoth [3 ]
机构
[1] Phosphene AI, Madurai, Tamil Nadu, India
[2] Umm Al Qura Univ, Mecca, Saudi Arabia
[3] Optisol Business Solut, Chennai, Tamil Nadu, India
[4] Taibah Univ, Dept Comp Sci, Medina, Saudi Arabia
[5] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
[6] Manchester Metropolitan Univ, Manchester, Lancs, England
关键词
Deep Learning; Image Memorability; Visual Emotions; Saliency; Object Interestingness;
D O I
10.7717/peerj-cs.767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: "What makes an image memorable?''. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [41] Deep CNN-based channel estimation for mmWave massive MIMO systems
    Dong, Peihao
    Zhang, Hua
    Ye Li, Geoffrey
    Gaspar, Ivan Simões
    NaderiAlizadeh, Navid
    arXiv, 2019,
  • [42] Deep Ensemble CNN Method Based on Sample Expansion for Hyperspectral Image Classification
    Dong, Shuxian
    Feng, Wei
    Quan, Yinghui
    Dauphin, Gabriel
    Gao, Lianru
    Xing, Mengdao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] Deep CNN-Based Channel Estimation for mmWave Massive MIMO Systems
    Dong, Peihao
    Zhang, Hua
    Li, Geoffrey Ye
    Gaspar, Ivan Simoes
    NaderiAlizadeh, Navid
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (05) : 989 - 1000
  • [44] Blind Dehazed Image Quality Assessment: A Deep CNN-Based Approach
    Lv, Xiao
    Xiang, Tao
    Yang, Ying
    Liu, Hantao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9410 - 9424
  • [45] Attention-based lightweight deep hybrid CNN framework for image restoration
    Karthikeyan, V.
    Visu, Y. Palin
    IMAGING SCIENCE JOURNAL, 2024,
  • [46] Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance
    Mishra, Dipti
    Singh, Satish Kumar
    Singh, Rajat Kumar
    NEUROCOMPUTING, 2022, 507 : 397 - 411
  • [47] CNN-based image small-angle rotation angle estimation
    Chen, Dianze
    Song, Zhonghao
    Guo, Jingyu
    Yang, Shuai
    Yao, Heng
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [48] Evaluation of CNN-Based Single-Image Depth Estimation Methods
    Koch, Tobias
    Liebel, Lukas
    Fraundorfer, Friedrich
    Koerner, Marco
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 331 - 348
  • [49] CNN-based local motion estimation chip for image stabilization processing
    Lin, Chin-Teng
    Chen, Shi-An
    Cheng, Ying-Chang
    Chung, Jen-Feng
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2645 - +
  • [50] Deep Residual Net Based Compact Feature Representation for Image Retrieval
    Bai, Cong
    Chen, Jian
    Ma, Qing
    Liu, Zhi
    Chen, Shengyong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 737 - 747