Video Instance Shadow Detection Under the Sun and Sky

被引:0
|
作者
Xing, Zhenghao [1 ]
Wang, Tianyu [1 ]
Hu, Xiaowei [2 ]
Wu, Haoran [1 ]
Fu, Chi-Wing [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
关键词
Radio frequency; Object recognition; Head; Feature extraction; Video sequences; Training; Testing; Instance segmentation; Complexity theory; Surveys; Instance shadow detection; shadow-object pairing; video analysis; shadow detection; REMOVAL;
D O I
10.1109/TIP.2024.3468877
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance shadow detection, crucial for applications such as photo editing and light direction estimation, has undergone significant advancements in predicting shadow instances, object instances, and their associations. The extension of this task to videos presents challenges in annotating diverse video data and addressing complexities arising from occlusion and temporary disappearances within associations. In response to these challenges, we introduce ViShadow, a semi-supervised video instance shadow detection framework that leverages both labeled image data and unlabeled video data for training. ViShadow features a two-stage training pipeline: the first stage, utilizing labeled image data, identifies shadow and object instances through contrastive learning for cross-frame pairing. The second stage employs unlabeled videos, incorporating an associated cycle consistency loss to enhance tracking ability. A retrieval mechanism is introduced to manage temporary disappearances, ensuring tracking continuity. The SOBA-VID dataset, comprising unlabeled training videos and labeled testing videos, along with the SOAP-VID metric, is introduced for the quantitative evaluation of VISD solutions. The effectiveness of ViShadow is further demonstrated through various video-level applications such as video inpainting, instance cloning, shadow editing, and text-instructed shadow-object manipulation.
引用
收藏
页码:5715 / 5726
页数:12
相关论文
共 50 条