Salient object detection in egocentric videos

被引:0
|
作者
Zhang, Hao [1 ]
Liang, Haoran [1 ]
Zhao, Xing [1 ]
Liu, Jian [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
image processing; object detection; SEGMENTATION; TRACKING;
D O I
10.1049/ipr2.13080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third-person perspective videos. However, this focus overlooks the unique requirements of certain first-person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera-based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. Dataset and codes can be found at . We propose a new egocentric video salient object detection (VSOD) dataset SalEgo. And we propose a new Camera Movement based method CaMSD for the new dataset and compare to some models. Experimental results show that our approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. image
引用
收藏
页码:2028 / 2037
页数:10
相关论文
共 50 条
  • [11] Real-time Salient Object Detection Engine for High Definition Videos
    Fu, Yu-Jie
    Wu, Guan-Lin
    Chien, Shao-Yi
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
  • [12] Jointly Recognizing Object Fluents and Tasks in Egocentric Videos
    Liu, Yang
    Wei, Ping
    Zhu, Song-Chun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2943 - 2951
  • [13] Predicting Human-Object Interactions in Egocentric Videos
    Benavent-Lledo, Manuel
    Oprea, Sergiu
    Alejandro Castro-Vargas, John
    Mulero-Perez, David
    Garcia-Rodriguez, Jose
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [14] Object Discovery Using CNN Features in Egocentric Videos
    Bolanos, Marc
    Garolera, Maite
    Radeva, Petia
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 67 - 74
  • [15] OGaze: Gaze Prediction in Egocentric Videos for Attentional Object Selection
    Al-Naser, Mohammad
    Siddiqui, Shoaib Ahmed
    Ohashi, Hiroki
    Ahmed, Sheraz
    Katsuyki, Nakamura
    Takuto, Sato
    Dengel, Andreas
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 270 - 277
  • [16] Next-active-object prediction from egocentric videos
    Furnari, Antonino
    Battiato, Sebastiano
    Grauman, Kristen
    Farinella, Giovanni Maria
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 401 - 411
  • [17] What is a Salient Object? A Dataset and a Baseline Model for Salient Object Detection
    Borji, Ali
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (02) : 742 - 756
  • [18] Salient Object Detection by Composition
    Feng, Jie
    Wei, Yichen
    Tao, Litian
    Zhang, Chao
    Sun, Jian
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1028 - 1035
  • [19] Spectral salient object detection
    Fu, Keren
    Gu, Irene Yu-Hua
    Yang, Jie
    NEUROCOMPUTING, 2018, 275 : 788 - 803
  • [20] Salient object detection: A survey
    Borji, Ali
    Cheng, Ming-Ming
    Hou, Qibin
    Jiang, Huaizu
    Li, Jia
    COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) : 117 - 150