Understanding and Predicting Image Memorability at a Large Scale

被引:187
|
作者
Khosla, Aditya [1 ]
Raju, Akhil S. [1 ]
Torralba, Antonio [1 ]
Oliva, Aude [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCV.2015.275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Progress in estimating visual memorability has been limited by the small scale and lack of variety of benchmark data. Here, we introduce a novel experimental procedure to objectively measure human memory, allowing us to build LaMem, the largest annotated image memorability dataset to date (containing 60,000 images from diverse sources). Using Convolutional Neural Networks (CNNs), we show that fine-tuned deep features outperform all other features by a large margin, reaching a rank correlation of 0.64, near human consistency (0.68). Analysis of the responses of the high-level CNN layers shows which objects and regions are positively, and negatively, correlated with memorability, allowing us to create memorability maps for each image and provide a concrete method to perform image memorability manipulation. This work demonstrates that one can now robustly estimate the memorability of images from many different classes, positioning memorability and deep memorability features as prime candidates to estimate the utility of information for cognitive systems. Our model and data are available at: http://memorability.csail.mit.edu
引用
收藏
页码:2390 / 2398
页数:9
相关论文
共 50 条
  • [41] Bag-of-features for image memorability evaluation
    Lahrache, Souad
    El Ouazzani, Rajae
    El Qadi, Abderrahim
    IET COMPUTER VISION, 2016, 10 (06) : 577 - 584
  • [42] The effect of intrinsic image memorability on recollection and familiarity
    N. Broers
    N.A. Busch
    Memory & Cognition, 2021, 49 : 998 - 1018
  • [43] Predicting Event Memorability from Contextual Visual Semantics
    Xu, Qianli
    Fang, Fen
    del Molino, Ana Garcia
    Subbaraju, Vigneshwaran
    Lim, Joo Hwee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [44] Correction to: The effect of intrinsic image memorability on recollection and familiarity
    N. Broers
    N. A. Busch
    Memory & Cognition, 2021, 49 : 1019 - 1019
  • [45] Fast Image Searching in Large Scale Image Database
    Durmaz, Osman
    Bilge, Hasan Sakir
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [46] Large Visual Words for Large Scale Image Classification
    Tang, Sheng
    Chen, Hui
    Lv, Ke
    Zhang, Yong-Dong
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1170 - 1174
  • [47] Image Memorability Prediction Using Depth and Motion Cues
    Basavaraju, Sathisha
    Sur, Arijit
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03) : 600 - 609
  • [48] PLSA on large scale image databases
    Lienhart, Rainer
    Slaney, Malcolm
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1217 - +
  • [49] Large scale image aided navigation
    Venable, Donald T.
    Raquet, John F.
    IEEE Transactions on Aerospace and Electronic Systems, 2016, 52 (06): : 2849 - 2860
  • [50] Large Scale Image Aided Navigation
    Venable, Donald T.
    Raquet, John F.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2016, 52 (06) : 2849 - 2860