A Unified Video Segmentation Benchmark: Annotation, Metrics and Analysis

被引:70
|
作者
Galasso, Fabio [1 ]
Nagaraja, Naveen Shankar [2 ]
Cardenas, Tatiana Jimenez [2 ]
Brox, Thomas [2 ]
Schiele, Bernt [1 ]
机构
[1] Max Planck Inst Informat, Berlin, Germany
[2] Univ Freiburg, Freiburg, Germany
关键词
D O I
10.1109/ICCV.2013.438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video segmentation research is currently limited by the lack of a benchmark dataset that covers the large variety of subproblems appearing in video segmentation and that is large enough to avoid overfitting. Consequently, there is little analysis of video segmentation which generalizes across subtasks, and it is not yet clear which and how video segmentation should leverage the information from the still-frames, as previously studied in image segmentation, alongside video specific information, such as temporal volume, motion and occlusion. In this work we provide such an analysis based on annotations of a large video dataset, where each video is manually segmented by multiple persons. Moreover, we introduce a new volume-based metric that includes the important aspect of temporal consistency, that can deal with segmentation hierarchies, and that reflects the tradeoff between over-segmentation and segmentation accuracy.
引用
收藏
页码:3527 / 3534
页数:8
相关论文
共 50 条
  • [31] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
    Ding, Henghui
    Liu, Chang
    He, Shuting
    Jiang, Xudong
    Loy, Chen Change
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2694 - 2703
  • [32] Learning spatiotemporal relationships with a unified framework for video object segmentation
    Mei, Jianbiao
    Wang, Mengmeng
    Yang, Yu
    Li, Zizhang
    Liu, Yong
    APPLIED INTELLIGENCE, 2024, 54 (08) : 6138 - 6153
  • [33] TarViS: A Unified Approach for Target-based Video Segmentation
    Athar, Ali
    Hermans, Alexander
    Luiten, Jonathon
    Ramanan, Deva
    Leibe, Bastian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18738 - 18748
  • [34] A UNIFIED FRAMEWORK FOR JOINT VIDEO PEDESTRIAN SEGMENTATION AND POSE TRACKING
    Li, Yanli
    Zhou, Zhong
    Wu, Wei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (07)
  • [35] Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
    Wang, Weiyao
    Feiszli, Matt
    Wang, Heng
    Tran, Du
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10756 - 10765
  • [36] A Multitask Benchmark Dataset for Satellite Video: Object Detection, Tracking, and Segmentation
    Li, Shengyang
    Zhou, Zhuang
    Zhao, Manqi
    Yang, Jian
    Guo, Weilong
    Lv, Yixuan
    Kou, Longxuan
    Wang, Han
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [37] Performance Analysis with Unified Hardware Counter Metrics
    Gravelle, Brian J.
    Nystrom, William David
    Norris, Boyana
    2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS), 2022, : 60 - 70
  • [38] Analysis of segmentation performance on the CEDAR benchmark database
    Blumenstein, M
    Verma, B
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1142 - 1146
  • [39] A Unified Model for Joint Chinese Word Segmentation and POS Tagging with Heterogeneous Annotation Corpora
    Zhao, Jiayi
    Qiu, Xipeng
    Huang, Xuanjing
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 227 - 230
  • [40] Video Question-Answering Techniques, Benchmark Datasets and Evaluation Metrics Leveraging Video Captioning: A Comprehensive Survey
    Khurana, Khushboo
    Deshpande, Umesh
    IEEE ACCESS, 2021, 9 (09): : 43799 - 43823