Salient object detection in egocentric videos

被引:0
|
作者
Zhang, Hao [1 ]
Liang, Haoran [1 ]
Zhao, Xing [1 ]
Liu, Jian [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
image processing; object detection; SEGMENTATION; TRACKING;
D O I
10.1049/ipr2.13080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third-person perspective videos. However, this focus overlooks the unique requirements of certain first-person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera-based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. Dataset and codes can be found at . We propose a new egocentric video salient object detection (VSOD) dataset SalEgo. And we propose a new Camera Movement based method CaMSD for the new dataset and compare to some models. Experimental results show that our approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. image
引用
收藏
页码:2028 / 2037
页数:10
相关论文
共 50 条
  • [21] Salient object detection: A survey
    Ali Borji
    Ming-Ming Cheng
    Qibin Hou
    Huaizu Jiang
    Jia Li
    Computational Visual Media, 2019, 5 (02) : 117 - 150
  • [22] Salient Object Detection: A Benchmark
    Borji, Ali
    Sihite, Dicky N.
    Itti, Laurent
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 414 - 429
  • [23] Salient Object Detection: A Benchmark
    Borji, Ali
    Cheng, Ming-Ming
    Jiang, Huaizu
    Li, Jia
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5706 - 5722
  • [24] Salient object detection: A survey
    Ali Borji
    Ming-Ming Cheng
    Qibin Hou
    Huaizu Jiang
    Jia Li
    Computational Visual Media, 2019, 5 : 117 - 150
  • [25] SPECTRAL SALIENT OBJECT DETECTION
    Fu, Keren
    Gong, Chen
    Gu, Irene Y. H.
    Yang, Jie
    He, Xiangjian
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [26] Salient Event Detection in Basketball Mobile Videos
    Cricri, Francesco
    Mate, Sujeet
    Curcio, Igor D. D.
    Gabbouj, Moncef
    2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2014, : 63 - 70
  • [27] Salient Object Detection with Pyramid Attention and Salient Edges
    Wang, Wenguan
    Zhao, Shuyang
    Shen, Jianbing
    Hoi, Steven C. H.
    Borji, Ali
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1448 - 1457
  • [28] Anonymizing Egocentric Videos
    Thapar, Daksh
    Nigam, Aditya
    Arora, Chetan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2300 - 2309
  • [29] BMST-Net: bidirectional multi-scale spatiotemporal network for salient object detection in videos
    Gaurav Sharma
    Maheep Singh
    Sandeep Chand Kumain
    Kamal Kumar
    Signal, Image and Video Processing, 2025, 19 (2)
  • [30] Object Detection in Sports Videos
    Buric, M.
    Pobar, M.
    Ivasic-Kos, M.
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1034 - 1039