Salient object detection in egocentric videos

被引：0

作者：

Zhang, Hao ^{[1
]}

Liang, Haoran ^{[1
]}

Zhao, Xing ^{[1
]}

Liu, Jian ^{[1
]}

Liang, Ronghua ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Hangzhou, Peoples R China

来源：

IET IMAGE PROCESSING | 2024年 / 18卷 / 08期

基金：

中国国家自然科学基金;

关键词：

image processing; object detection; SEGMENTATION; TRACKING;

D O I：

10.1049/ipr2.13080

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third-person perspective videos. However, this focus overlooks the unique requirements of certain first-person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera-based VSOD model, CaMSD, specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. Dataset and codes can be found at . We propose a new egocentric video salient object detection (VSOD) dataset SalEgo. And we propose a new Camera Movement based method CaMSD for the new dataset and compare to some models. Experimental results show that our approach outperforms other state-of-the-art methods in egocentric video salient object detection tasks. image

引用

页码：2028 / 2037

页数：10

共 50 条

[11] Real-time Salient Object Detection Engine for High Definition Videos
Fu, Yu-Jie
Wu, Guan-Lin
Chien, Shao-Yi
2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
[12] Jointly Recognizing Object Fluents and Tasks in Egocentric Videos
Liu, Yang
Wei, Ping
Zhu, Song-Chun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2943 - 2951
[13] Predicting Human-Object Interactions in Egocentric Videos
Benavent-Lledo, Manuel
Oprea, Sergiu
Alejandro Castro-Vargas, John
Mulero-Perez, David
Garcia-Rodriguez, Jose
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[14] Object Discovery Using CNN Features in Egocentric Videos
Bolanos, Marc
Garolera, Maite
Radeva, Petia
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 67 - 74
[15] OGaze: Gaze Prediction in Egocentric Videos for Attentional Object Selection
Al-Naser, Mohammad
Siddiqui, Shoaib Ahmed
Ohashi, Hiroki
Ahmed, Sheraz
Katsuyki, Nakamura
Takuto, Sato
Dengel, Andreas
2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 270 - 277
[16] Next-active-object prediction from egocentric videos
Furnari, Antonino
Battiato, Sebastiano
Grauman, Kristen
Farinella, Giovanni Maria
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 401 - 411
[17] What is a Salient Object? A Dataset and a Baseline Model for Salient Object Detection
Borji, Ali
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (02) : 742 - 756
[18] Salient Object Detection by Composition
Feng, Jie
Wei, Yichen
Tao, Litian
Zhang, Chao
Sun, Jian
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1028 - 1035
[19] Spectral salient object detection
Fu, Keren
Gu, Irene Yu-Hua
Yang, Jie
NEUROCOMPUTING, 2018, 275 : 788 - 803
[20] Salient object detection: A survey
Borji, Ali
Cheng, Ming-Ming
Hou, Qibin
Jiang, Huaizu
Li, Jia
COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) : 117 - 150

← 1 2 3 4 5 →