Temporal resolution vs. visual saliency in videos: Analysis of gaze patterns and evaluation of saliency models

被引:2
|
作者
Cheon, Manri [1 ,2 ]
Lee, Jong-Seok [1 ,2 ]
机构
[1] Yonsei Univ, Sch Integrated Technol, Inchon 406840, South Korea
[2] Yonsei Univ, Yonsei Inst Convergence Technol, Inchon 406840, South Korea
基金
新加坡国家研究基金会;
关键词
Temporal scalability; Eye-tracking; Frame rate; Perception; Quality of experience; Saliency model;
D O I
10.1016/j.image.2015.05.010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Temporal scalability of videos refers to the possibility of changing frame rate adaptively for efficient video transmission. Changing the frame rate may alter the spatial location that the viewers pay attention in the scene, which in turn significantly influences human's quality perception. Therefore, in order to effectively exploit the temporal scalability in applications, it is necessary to understand the relationship between frame rate variation and visual saliency. In this study, we answer the following three research questions: (1) Does the frame rate influence the overall gaze patterns (in an average sense over subjects)? (2) Does the frame rate influence the inter-subject variability of the gaze patterns? (3) Do the state-of-the-art saliency models predict human gaze patterns reliably for different frame rates? To answer the first two questions, we conduct an eye-tracking experiment. Under a free viewing scenario, we collect and analyze gaze-paths of human subjects watching high-definition (HD) videos having a normal or low frame rate. Our results show that both the average gaze-path and subject-wise variability of the gaze-path are influenced by frame rate variation. Then, we apply representative state-of-the-art saliency models to the videos and evaluate their performance by using the gaze pattern data collected from the eye-tracking experiment in order to answer the third question. It is shown that there exists a trade-off relation between accuracy in predicting the gaze pattern and robustness to frame rate variation, which raises necessity of further research in saliency modeling to simultaneously achieve both accuracy and robustness. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:405 / 417
页数:13
相关论文
共 31 条
  • [1] Identifying Pitfalls in the Evaluation of Saliency Models for Videos
    Dong, Zhengyan
    Wu, Xinbo
    Zhao, Xin
    Zhang, Fan
    Liu, Hantao
    [J]. 2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [2] Evaluation of Several Visual Saliency Models in Terms of Gaze Prediction Accuracy on Video
    Mateescu, Victor A.
    Hadizadeh, Hadi
    Bajic, Ivan V.
    [J]. 2012 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2012, : 1304 - 1308
  • [3] Modelling Spatio-Temporal Saliency to Predict Gaze Direction for Short Videos
    Sophie Marat
    Tien Ho Phuoc
    Lionel Granjon
    Nathalie Guyader
    Denis Pellerin
    Anne Guérin-Dugué
    [J]. International Journal of Computer Vision, 2009, 82 : 231 - 243
  • [4] Modelling Spatio-Temporal Saliency to Predict Gaze Direction for Short Videos
    Marat, Sophie
    Phuoc, Tien Ho
    Granjon, Lionel
    Guyader, Nathalie
    Pellerin, Denis
    Guerin-Dugue, Anne
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 82 (03) : 231 - 243
  • [5] A Robust Metric for the Evaluation of Visual Saliency Models
    Sharma, Puneet
    Alsam, Ali
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 654 - 661
  • [6] Visual saliency models for summarization of diagnostic hysteroscopy videos in healthcare systems
    Muhammad, Khan
    Ahmad, Jamil
    Sajjad, Muhammad
    Baik, Sung Wook
    [J]. SPRINGERPLUS, 2016, 5
  • [7] Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition
    Mathe, Stefan
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (07) : 1408 - 1424
  • [8] Geometrical cues in visual saliency models for active object recognition in egocentric videos
    Vincent Buso
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    [J]. Multimedia Tools and Applications, 2015, 74 : 10077 - 10095
  • [9] Geometrical cues in visual saliency models for active object recognition in egocentric videos
    Buso, Vincent
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 10077 - 10095
  • [10] Analysis of scores, datasets, and models in visual saliency prediction
    Borji, Ali
    Tavakoli, Hamed R.
    Sihite, Dicky N.
    Itti, Laurent
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 921 - 928