Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation

被引：2

作者：

Choudhuri, Anwesa ^{[1
]}

Chowdhary, Girish ^{[1
]}

Schwing, Alexander G. ^{[1
]}

机构：

[1] Univ Illinois, Champaign, IL 61801 USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

基金：

美国食品与农业研究所;

关键词：

D O I：

10.1109/CVPR52729.2023.00617

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object queries have emerged as a powerful abstraction to generically represent object proposals. However, their use for temporal tasks like video segmentation poses two questions: 1) How to process frames sequentially and propagate object queries seamlessly across frames. Using independent object queries per frame doesn't permit tracking, and requires post-processing. 2) How to produce temporally consistent, yet expressive object queries that model both appearance and position changes. Using the entire video at once doesn't capture position changes and doesn't scale to long videos. As one answer to both questions we propose 'context-aware relative object queries', which are continuously propagated frame-by-frame. They seamlessly track objects and deal with occlusion and re-appearance of objects, without post-processing. Further, we find context-aware relative object queries better capture position changes of objects in motion. We evaluate the proposed approach across three challenging tasks: video instance segmentation, multi-object tracking and segmentation, and video panoptic segmentation. Using the same approach and architecture, we match or surpass state-of-the art results on the diverse and challenging OVIS, Youtube-VIS, Cityscapes-VPS, MOTS 2020 and KITTI-MOTS data.

引用

下载

页码：6377 / 6386

页数：10

共 50 条

[1] Context-aware Deformable Alignment for Video Object Segmentation
Yang, Jie
Xia, Mingfu
Zhou, Xue
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 303 - 309
[2] Context-Aware Video Object Proposals
Geng, Wenjing
Wu, Gangshan
2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 1203 - 1206
[3] Adaptive video object proposals by a context-aware model
Geng, Wenjing
Zhang, Chunlong
Wu, Gangshan
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 10589 - 10614
[4] Adaptive video object proposals by a context-aware model
Wenjing Geng
Chunlong Zhang
Gangshan Wu
Multimedia Tools and Applications, 2018, 77 : 10589 - 10614
[5] Instance Sequence Queries for Video Instance Segmentation with Transformers
Xu, Zhujun
Vivet, Damien
SENSORS, 2021, 21 (13)
[6] Instance Motion Tendency Learning for Video Panoptic Segmentation
Wang, Le
Liu, Hongzhen
Zhou, Sanping
Tang, Wei
Hua, Gang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 764 - 778
[7] Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation
Borse, Shubhankar
Park, Hyojin
Cai, Hong
Das, Debasmit
Garrepalli, Risheek
Porikli, Fatih
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1259 - 1269
[8] Context-aware Method for Small Object Segmentation in Road Scenes
Wang, Haitao
Chen, Guang
Li, Zhijun
Peng, Jianyi
Liu, Zhengfa
Wu, Ya
2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 238 - 243
[9] Video Object Segmentation in Panoptic Wild Scenes
Xu, Yuanyou
Yang, Zongxin
Yang, Yi
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1604 - 1612
[10] Context-Aware MIML Instance Annotation
Briggs, Forrest
Fern, Xiaoli Z.
Raich, Raviv
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 41 - 50

← 1 2 3 4 5 →