Video Instance Shadow Detection Under the Sun and Sky

被引:0
|
作者
Xing, Zhenghao [1 ]
Wang, Tianyu [1 ]
Hu, Xiaowei [2 ]
Wu, Haoran [1 ]
Fu, Chi-Wing [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
关键词
Radio frequency; Object recognition; Head; Feature extraction; Video sequences; Training; Testing; Instance segmentation; Complexity theory; Surveys; Instance shadow detection; shadow-object pairing; video analysis; shadow detection; REMOVAL;
D O I
10.1109/TIP.2024.3468877
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance shadow detection, crucial for applications such as photo editing and light direction estimation, has undergone significant advancements in predicting shadow instances, object instances, and their associations. The extension of this task to videos presents challenges in annotating diverse video data and addressing complexities arising from occlusion and temporary disappearances within associations. In response to these challenges, we introduce ViShadow, a semi-supervised video instance shadow detection framework that leverages both labeled image data and unlabeled video data for training. ViShadow features a two-stage training pipeline: the first stage, utilizing labeled image data, identifies shadow and object instances through contrastive learning for cross-frame pairing. The second stage employs unlabeled videos, incorporating an associated cycle consistency loss to enhance tracking ability. A retrieval mechanism is introduced to manage temporary disappearances, ensuring tracking continuity. The SOBA-VID dataset, comprising unlabeled training videos and labeled testing videos, along with the SOAP-VID metric, is introduced for the quantitative evaluation of VISD solutions. The effectiveness of ViShadow is further demonstrated through various video-level applications such as video inpainting, instance cloning, shadow editing, and text-instructed shadow-object manipulation.
引用
收藏
页码:5715 / 5726
页数:12
相关论文
共 50 条
  • [1] What Characterizes a Shadow Boundary under the Sun and Sky?
    Huang, Xiang
    Hua, Gang
    Tumblin, Jack
    Williams, Lance
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 898 - 905
  • [2] Instance Shadow Detection
    Wang, Tianyu
    Hu, Xiaowei
    Wang, Qiong
    Heng, Pheng-Ann
    Fu, Chi-Wing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1877 - 1886
  • [3] Learning Shadow Correspondence for Video Shadow Detection
    Ding, Xinpeng
    Yang, Jingwen
    Hu, Xiaowei
    Li, Xiaomeng
    COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 705 - 722
  • [4] Instance Shadow Detection With a Single-Stage Detector
    Wang, Tianyu
    Hu, Xiaowei
    Heng, Pheng-Ann
    Fu, Chi-Wing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3259 - 3273
  • [5] Shadow Detection and Sun Direction in Photo Collections
    Wehrwein, Scott
    Bala, Kavita
    Snavely, Noah
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 460 - 468
  • [6] Cast shadow detection in video segmentation
    Dong, X
    Li, XL
    Liu, ZK
    Yuan, Y
    PATTERN RECOGNITION LETTERS, 2005, 26 (01) : 91 - 99
  • [7] Insignificant shadow detection for video segmentation
    Xu, D
    Liu, JZ
    Li, XL
    Liu, ZK
    Tang, X
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (08) : 1058 - 1064
  • [8] Foreground and Shadow Detection for Video Surveillance
    Park, Suwoo
    Yun, Jooseop
    Park, Sehyun
    Do, Yongtae
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'09), 2009, : 171 - +
  • [9] Indoor shadow detection for video segmentation
    Xu, D
    Liu, JZ
    Liu, ZK
    Tang, XO
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 41 - 44
  • [10] Detect Any Shadow: Segment Anything for Video Shadow Detection
    Wang, Yonghui
    Zhou, Wengang
    Mao, Yunyao
    Li, Houqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3782 - 3794