Video Instance Shadow Detection Under the Sun and Sky

被引:0
|
作者
Xing, Zhenghao [1 ]
Wang, Tianyu [1 ]
Hu, Xiaowei [2 ]
Wu, Haoran [1 ]
Fu, Chi-Wing [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
关键词
Radio frequency; Object recognition; Head; Feature extraction; Video sequences; Training; Testing; Instance segmentation; Complexity theory; Surveys; Instance shadow detection; shadow-object pairing; video analysis; shadow detection; REMOVAL;
D O I
10.1109/TIP.2024.3468877
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance shadow detection, crucial for applications such as photo editing and light direction estimation, has undergone significant advancements in predicting shadow instances, object instances, and their associations. The extension of this task to videos presents challenges in annotating diverse video data and addressing complexities arising from occlusion and temporary disappearances within associations. In response to these challenges, we introduce ViShadow, a semi-supervised video instance shadow detection framework that leverages both labeled image data and unlabeled video data for training. ViShadow features a two-stage training pipeline: the first stage, utilizing labeled image data, identifies shadow and object instances through contrastive learning for cross-frame pairing. The second stage employs unlabeled videos, incorporating an associated cycle consistency loss to enhance tracking ability. A retrieval mechanism is introduced to manage temporary disappearances, ensuring tracking continuity. The SOBA-VID dataset, comprising unlabeled training videos and labeled testing videos, along with the SOAP-VID metric, is introduced for the quantitative evaluation of VISD solutions. The effectiveness of ViShadow is further demonstrated through various video-level applications such as video inpainting, instance cloning, shadow editing, and text-instructed shadow-object manipulation.
引用
收藏
页码:5715 / 5726
页数:12
相关论文
共 50 条
  • [21] The shadow of the sun
    Finnegan, W
    NEW YORK TIMES BOOK REVIEW, 2001, : 11 - 11
  • [22] 'SHADOW TO SUN'
    DONALD, M
    ANTIGONISH REVIEW, 1994, (98): : 129 - 129
  • [23] The Shadow and the Sun
    MacAdam, Ethan H.
    LAW CULTURE AND THE HUMANITIES, 2021, 17 (01) : 17 - 34
  • [24] In the shadow of the Sun
    David Gevaux
    Nature Physics, 2013, 9 (8) : 454 - 454
  • [25] SHADOW OF SUN
    JONES, AL
    SCIENCE, 1974, 185 (4154) : 813 - 813
  • [26] Video Event Detection by Inferring Temporal Instance Labels
    Lai, Kuan-Ting
    Yu, Felix X.
    Chen, Ming-Syan
    Chang, Shih-Fu
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2251 - 2258
  • [27] Triple-cooperative Video Shadow Detection
    Chen, Zhihao
    Wan, Liang
    Zhu, Lei
    Shen, Jia
    Fu, Huazhu
    Liu, Wennan
    Qin, Jing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2714 - 2723
  • [28] Sharp Multiple Instance Learning for DeepFake Video Detection
    Li, Xiaodan
    Lang, Yining
    Chen, Yuefeng
    Mao, Xiaofeng
    He, Yuan
    Wang, Shuhui
    Xue, Hui
    Lu, Quan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1864 - 1872
  • [29] Multiple Instance Relational Learning for Video Anomaly Detection
    Dengxiong, Xiwen
    Bao, Wentao
    Kong, Yu
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [30] Single-Stage Instance Shadow Detection with Bidirectional Relation Learning
    Wang, Tianyu
    Hu, Xiaowei
    Fu, Chi-Wing
    Heng, Pheng-Ann
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1 - 11