Salient object detection in egocentric videos

被引:0
|
作者
Zhang, Hao [1 ]
Liang, Haoran [1 ]
Zhao, Xing [1 ]
Liu, Jian [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
image processing; object detection; SEGMENTATION; TRACKING;
D O I
10.1049/ipr2.13080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third-person perspective videos. However, this focus overlooks the unique requirements of certain first-person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera-based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. Dataset and codes can be found at . We propose a new egocentric video salient object detection (VSOD) dataset SalEgo. And we propose a new Camera Movement based method CaMSD for the new dataset and compare to some models. Experimental results show that our approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. image
引用
收藏
页码:2028 / 2037
页数:10
相关论文
共 50 条
  • [1] Graph Construction for Salient Object Detection in Videos
    Fu, Keren
    Gu, Irene Y. H.
    Yun, Yixiao
    Gong, Chen
    Yang, Jie
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2371 - 2376
  • [2] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE Transactions on Image Processing, 2024, 33 : 6660 - 6675
  • [3] Particle filter framework for salient object detection in videos
    Muthuswamy, Karthik
    Rajan, Deepu
    IET COMPUTER VISION, 2015, 9 (03) : 428 - 438
  • [4] Self-Supervised Object Detection from Egocentric Videos
    Akiva, Peri
    Huang, Jing
    Liang, Kevin J.
    Kovvuri, Rama
    Chen, Xingyu
    Feiszli, Matt
    Dana, Kristin
    Hassner, Tal
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5202 - 5214
  • [5] Object proposals for salient object segmentation in videos
    Kalboussi, Rahma
    Azaza, Aymen
    van de Weijer, Joost
    Abdellaoui, Mehrez
    Douik, Ali
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 8677 - 8693
  • [6] Object proposals for salient object segmentation in videos
    Rahma Kalboussi
    Aymen Azaza
    Joost van de Weijer
    Mehrez Abdellaoui
    Ali Douik
    Multimedia Tools and Applications, 2020, 79 : 8677 - 8693
  • [7] Learning Video Salient Object Detection Progressively from Unlabeled Videos
    Xu, Binwei
    Liang, Haoran
    Ni, Wentian
    Gong, Weihua
    Liang, Ronghua
    Chen, Peng
    arXiv, 2022,
  • [8] Detecting and Recognizing Salient Object in Videos
    Kalboussi, Rahma
    Abdellaoui, Mehrez
    Douik, Ali
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 62 - 73
  • [9] Overview of deep-learning based methods for salient object detection in videos
    Wang, Qiong
    Zhang, Lu
    Li, Yan
    Kpalma, Kidiyo
    PATTERN RECOGNITION, 2020, 104 (104)
  • [10] Real-time Salient Object Detection Engine for High Definition Videos
    Fu, Yu-Jie
    Wu, Guan-Lin
    Chien, Shao-Yi
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,