Salient object detection in egocentric videos

被引:0
|
作者
Zhang, Hao [1 ]
Liang, Haoran [1 ]
Zhao, Xing [1 ]
Liu, Jian [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
image processing; object detection; SEGMENTATION; TRACKING;
D O I
10.1049/ipr2.13080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third-person perspective videos. However, this focus overlooks the unique requirements of certain first-person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera-based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. Dataset and codes can be found at . We propose a new egocentric video salient object detection (VSOD) dataset SalEgo. And we propose a new Camera Movement based method CaMSD for the new dataset and compare to some models. Experimental results show that our approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. image
引用
收藏
页码:2028 / 2037
页数:10
相关论文
共 50 条
  • [31] Discovering salient objects from videos using spatiotemporal salient region detection
    Kannan, Rajkumar
    Ghinea, Gheorghita
    Swaminathan, Sridhar
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 36 : 154 - 178
  • [32] Geometrical cues in visual saliency models for active object recognition in egocentric videos
    Vincent Buso
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    Multimedia Tools and Applications, 2015, 74 : 10077 - 10095
  • [33] Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Liu, Jiang-Jiang
    Gao, Shang-Hua
    Hou, Qibin
    Borji, Ali
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 196 - 212
  • [34] SALIENT OBJECT DETECTION IN HYPERSPECTRAL IMAGERY
    Liang, Jie
    Zhou, Jun
    Bai, Xiao
    Qian, Yuntao
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2393 - 2397
  • [35] Salient Region Detection for Object Tracking
    Chan, Fan
    Jiang, Min
    Tang, Jinshan
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2012, 2012, 8406
  • [36] Super Diffusion for Salient Object Detection
    Jiang, Peng
    Pan, Zhiyi
    Tu, Changhe
    Vasconcelos, Nuno
    Chen, Baoquan
    Peng, Jingliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2903 - 2917
  • [37] Salient object detection based on regions
    Zhuojia Liang
    Mingjia Wang
    Xiaocong Zhou
    Liang Lin
    Wenjun Li
    Multimedia Tools and Applications, 2014, 68 : 517 - 544
  • [38] SALIENT OBJECT DETECTION WITH BOUNDARY INFORMATION
    Chen, Kai
    Wang, Yongxiong
    Hu, Chuanfei
    Shao, Hang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [39] Oscillation analysis for salient object detection
    Liu, Yang
    Li, Xueqing
    Wang, Lei
    Niu, Yuzhen
    Liu, Feng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 68 (03) : 659 - 679
  • [40] Salient Object Detection With Importance Degree
    Umeki, Yo
    Funahashi, Isana
    Yoshida, Taichi
    Iwahashi, Masahiro
    IEEE ACCESS, 2020, 8 (08): : 147059 - 147069