Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation

被引:2
|
作者
Choudhuri, Anwesa [1 ]
Chowdhary, Girish [1 ]
Schwing, Alexander G. [1 ]
机构
[1] Univ Illinois, Champaign, IL 61801 USA
基金
美国食品与农业研究所;
关键词
D O I
10.1109/CVPR52729.2023.00617
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object queries have emerged as a powerful abstraction to generically represent object proposals. However, their use for temporal tasks like video segmentation poses two questions: 1) How to process frames sequentially and propagate object queries seamlessly across frames. Using independent object queries per frame doesn't permit tracking, and requires post-processing. 2) How to produce temporally consistent, yet expressive object queries that model both appearance and position changes. Using the entire video at once doesn't capture position changes and doesn't scale to long videos. As one answer to both questions we propose 'context-aware relative object queries', which are continuously propagated frame-by-frame. They seamlessly track objects and deal with occlusion and re-appearance of objects, without post-processing. Further, we find context-aware relative object queries better capture position changes of objects in motion. We evaluate the proposed approach across three challenging tasks: video instance segmentation, multi-object tracking and segmentation, and video panoptic segmentation. Using the same approach and architecture, we match or surpass state-of-the art results on the diverse and challenging OVIS, Youtube-VIS, Cityscapes-VPS, MOTS 2020 and KITTI-MOTS data.
引用
下载
收藏
页码:6377 / 6386
页数:10
相关论文
共 50 条
  • [1] Context-aware Deformable Alignment for Video Object Segmentation
    Yang, Jie
    Xia, Mingfu
    Zhou, Xue
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 303 - 309
  • [2] Context-Aware Video Object Proposals
    Geng, Wenjing
    Wu, Gangshan
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 1203 - 1206
  • [3] Adaptive video object proposals by a context-aware model
    Geng, Wenjing
    Zhang, Chunlong
    Wu, Gangshan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 10589 - 10614
  • [4] Adaptive video object proposals by a context-aware model
    Wenjing Geng
    Chunlong Zhang
    Gangshan Wu
    Multimedia Tools and Applications, 2018, 77 : 10589 - 10614
  • [5] Instance Sequence Queries for Video Instance Segmentation with Transformers
    Xu, Zhujun
    Vivet, Damien
    SENSORS, 2021, 21 (13)
  • [6] Instance Motion Tendency Learning for Video Panoptic Segmentation
    Wang, Le
    Liu, Hongzhen
    Zhou, Sanping
    Tang, Wei
    Hua, Gang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 764 - 778
  • [7] Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation
    Borse, Shubhankar
    Park, Hyojin
    Cai, Hong
    Das, Debasmit
    Garrepalli, Risheek
    Porikli, Fatih
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1259 - 1269
  • [8] Context-aware Method for Small Object Segmentation in Road Scenes
    Wang, Haitao
    Chen, Guang
    Li, Zhijun
    Peng, Jianyi
    Liu, Zhengfa
    Wu, Ya
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 238 - 243
  • [9] Video Object Segmentation in Panoptic Wild Scenes
    Xu, Yuanyou
    Yang, Zongxin
    Yang, Yi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1604 - 1612
  • [10] Context-Aware MIML Instance Annotation
    Briggs, Forrest
    Fern, Xiaoli Z.
    Raich, Raviv
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 41 - 50