Understanding and Predicting Image Memorability at a Large Scale

被引:187
|
作者
Khosla, Aditya [1 ]
Raju, Akhil S. [1 ]
Torralba, Antonio [1 ]
Oliva, Aude [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCV.2015.275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Progress in estimating visual memorability has been limited by the small scale and lack of variety of benchmark data. Here, we introduce a novel experimental procedure to objectively measure human memory, allowing us to build LaMem, the largest annotated image memorability dataset to date (containing 60,000 images from diverse sources). Using Convolutional Neural Networks (CNNs), we show that fine-tuned deep features outperform all other features by a large margin, reaching a rank correlation of 0.64, near human consistency (0.68). Analysis of the responses of the high-level CNN layers shows which objects and regions are positively, and negatively, correlated with memorability, allowing us to create memorability maps for each image and provide a concrete method to perform image memorability manipulation. This work demonstrates that one can now robustly estimate the memorability of images from many different classes, positioning memorability and deep memorability features as prime candidates to estimate the utility of information for cognitive systems. Our model and data are available at: http://memorability.csail.mit.edu
引用
收藏
页码:2390 / 2398
页数:9
相关论文
共 50 条
  • [21] Illustrative Language Understanding: Large-Scale Visual Grounding with Image Search
    Kiros, Jamie Ryan
    Chan, William
    Hinton, Geoffrey E.
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 922 - 933
  • [22] LSH-based semantic dictionary learning for large scale image understanding
    Li, Liang
    Yan, Chenggang Clarence
    Ji, Wen
    Chen, Bo-Wei
    Jiang, Shuqiang
    Huang, Qingming
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 31 : 231 - 236
  • [23] BIGEARTHNET: A LARGE-SCALE BENCHMARK ARCHIVE FOR REMOTE SENSING IMAGE UNDERSTANDING
    Sumbul, Gencer
    Charfuelan, Marcela
    Demir, Beguem
    Markl, Volker
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5901 - 5904
  • [24] Rules of photography for image memorability analysis
    Lahrache, Souad
    El Ouazzani, Rajae
    El Qadi, Abderrahim
    IET IMAGE PROCESSING, 2018, 12 (07) : 1228 - 1236
  • [25] IMAGE MEMORABILITY: THE ROLE OF DEPTH AND MOTION
    Basavaraju, Sathisha
    Mittal, Paritosh
    Sur, Arijit
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 699 - 703
  • [26] Memorability-based image compression
    Khanna, Meera Thapar
    Ralekar, Chetan
    Goel, Anurika
    Chaudhury, Santanu
    Lall, Brejesh
    IET IMAGE PROCESSING, 2019, 13 (09) : 1490 - 1501
  • [27] Intrinsic and extrinsic effects on image memorability
    Bylinskii, Zoya
    Isola, Phillip
    Bainbridge, Constance
    Torralba, Antonio
    Oliva, Aude
    VISION RESEARCH, 2015, 116 : 165 - 178
  • [28] Pupillary Responses Reflect Image Memorability
    Niimi, Ryosuke
    PSYCHOPHYSIOLOGY, 2025, 62 (02)
  • [29] Predicting the Memorability of Natural-scene Images
    Lu, Jiaxin
    Xu, Mai
    Wang, Zulin
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [30] Understanding memorability through artificial and artist intelligence
    Goetschalckx, Lore
    Damiano, Claudia
    TRENDS IN COGNITIVE SCIENCES, 2023, 27 (11) : 983 - 984