Mask-Free Video Instance Segmentation

被引:9
|
作者
Ke, Lei [1 ,2 ]
Danelljan, Martin [1 ]
Ding, Henghui [1 ]
Tai, Yu-Wing [2 ]
Tang, Chi-Keung [2 ]
Yu, Fisher [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] HKUST, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent advancement in Video Instance Segmentation (VIS) has largely been driven by the use of deeper and increasingly data-hungry transformer-based models. However, video masks are tedious and expensive to annotate, limiting the scale and diversity of existing VIS datasets. In this work, we aim to remove the mask-annotation requirement. We propose MaskFreeVIS, achieving highly competitive VIS performance, while only using bounding box annotations for the object state. We leverage the rich temporal mask consistency constraints in videos by introducing the Temporal KNN-patch Loss (TK-Loss), providing strong mask supervision without any labels. Our TK-Loss finds one-to-many matches across frames, through an efficient patch-matching step followed by a K-nearest neighbor selection. A consistency loss is then enforced on the found matches. Our mask-free objective is simple to implement, has no trainable parameters, is computationally efficient, yet outperforms baselines employing, e.g., state-of-the-art optical flow to enforce temporal mask consistency. We validate MaskFreeVIS on the YouTube-VIS 2019/2021, OVIS and BDD100K MOTS benchmarks. The results clearly demonstrate the efficacy of our method by drastically narrowing the gap between fully and weakly-supervised VIS performance. Our code and trained models are available at http://vis.xyz/pub/maskfreevis.
引用
收藏
页码:22857 / 22866
页数:10
相关论文
共 50 条
  • [21] An anchor-free instance segmentation method for cells based on mask contourAn anchor-free instance segmentation method for cells based on mask contourQ. Chen et al.
    Qi Chen
    Huihuang Zhang
    Qianwei Zhou
    Qiu Guan
    Haigen Hu
    Applied Intelligence, 2025, 55 (2)
  • [22] DynaMask: Dynamic Mask Selection for Instance Segmentation
    Li, Ruihuang
    He, Chenhang
    Li, Shuai
    Zhang, Yabin
    Zhang, Lei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11279 - 11288
  • [23] MaskPlus: Improving Mask Generation for Instance Segmentation
    Xu, Shichao
    Lan, Shuyue
    Zhu, Chi
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2019 - 2027
  • [24] SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches
    Zeng, Yu
    Lin, Zhe
    Patel, Vishal M.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5941 - 5951
  • [25] Occluded Video Instance Segmentation: A Benchmark
    Qi, Jiyang
    Gao, Yan
    Hu, Yao
    Wang, Xinggang
    Liu, Xiaoyu
    Bai, Xiang
    Belongie, Serge
    Yuille, Alan
    Torr, Philip H. S.
    Bai, Song
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 2022 - 2039
  • [26] MobileInst: Video Instance Segmentation on the Mobile
    Zhang, Renhong
    Cheng, Tianheng
    Yang, Shusheng
    Jiang, Haoyi
    Zhang, Shuai
    Lyu, Jiancheng
    Li, Xin
    Ying, Xiaowen
    Gao, Dashan
    Liu, Wenyu
    Wang, Xinggang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7260 - 7268
  • [27] Occluded Video Instance Segmentation: A Benchmark
    Jiyang Qi
    Yan Gao
    Yao Hu
    Xinggang Wang
    Xiaoyu Liu
    Xiang Bai
    Serge Belongie
    Alan Yuille
    Philip H. S. Torr
    Song Bai
    International Journal of Computer Vision, 2022, 130 : 2022 - 2039
  • [28] A Generalized Framework for Video Instance Segmentation
    Heo, Miran
    Hwang, Sukjun
    Hyun, Jeongseok
    Kim, Hanjung
    Oh, Seoung Wug
    Lee, Joon-Young
    Kim, Seon Joo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14623 - 14632
  • [29] Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation
    Zhu, Feng
    Yang, Zongxin
    Yu, Xin
    Yang, Yi
    Wei, Yunchao
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 524 - 540
  • [30] A Mask-Free Passivation Process for Low Noise Nanopore Devices
    Lim, Min-Cheol
    Lee, Min-Hyun
    Kim, Ki-Bum
    Jeon, Tae-Joon
    Kim, Young-Rok
    JOURNAL OF NANOSCIENCE AND NANOTECHNOLOGY, 2015, 15 (08) : 5971 - 5977