FOREGROUND SEGMENTATION FOR STATIC VIDEO VIA MULTI-CORE AND MULTI-MODAL GRAPH CUT

被引:0
|
作者
Chang, Lun-Yu [1 ]
Hsu, Winston H. [1 ]
机构
[1] Natl Taiwan Univ, Taipei 10764, Taiwan
关键词
foreground detection; graph cut; surveillance; multi-core; silhouette;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Foreground detection is essential for semantic understanding and discovery for surveillance videos but still suffers from inefficiency and poor shape or silhouette detection. We argue to leverage multiple modalities (e.g., color appearance, foreground likelihood, spatial continuity, etc.) for foreground detection and propose a rigorous fusion method by graph cut. We further devise three strategies (e.g., dividing the graph cut problem into several subtasks, exploiting multi-core platform, etc.) to speed up the detection. Experimenting in open benchmarks, the proposed method outperforms other rival approaches in terms of detection accuracy and frame rate.
引用
收藏
页码:1362 / 1365
页数:4
相关论文
共 50 条
  • [1] Flexible Multi-modal Graph-Based Segmentation
    Sanberg, Willem P.
    Do, Luat
    de With, Peter H. N.
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 492 - 503
  • [2] A multi-modal approach to story segmentation for news video
    Chaisorn, L
    Chua, TS
    Lee, CH
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (02): : 187 - 208
  • [3] A Multi-Modal Approach to Story Segmentation for News Video
    Lekha Chaisorn
    Tat-Seng Chua
    Chin-Hui Lee
    World Wide Web, 2003, 6 : 187 - 208
  • [4] Multi-category Graph Reasoning for Multi-modal Brain Tumor Segmentation
    Li, Dongzhe
    Yang, Baoyao
    Zhan, Weide
    He, Xiaochen
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 445 - 455
  • [5] Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
    Geng, Shijie
    Gao, Peng
    Chatterjee, Moitreya
    Hori, Chiori
    Le Roux, Jonathan
    Zhang, Yongfeng
    Li, Hongsheng
    Cherian, Anoop
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1415 - 1423
  • [6] Multi-modal graph reasoning for structured video text extraction
    Shi, Weitao
    Wang, Han
    Lou, Xin
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 107
  • [7] CMGNet: Collaborative multi-modal graph network for video captioning
    Rao, Qi
    Yu, Xin
    Li, Guang
    Zhu, Linchao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [8] Contrasting Multi-Modal Similarity Framework for Video Scene Segmentation
    Park, Jinwoo
    Kim, Jungeun
    Seok, Jaegwang
    Lee, Sukhyun
    Kim, Junyeong
    IEEE ACCESS, 2024, 12 : 32408 - 32419
  • [9] Unpaired Multi-Modal Segmentation via Knowledge Distillation
    Dou, Qi
    Liu, Quande
    Heng, Pheng Ann
    Glocker, Ben
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) : 2415 - 2425
  • [10] Multi-modal Abnormality Detection in Video with Unknown Data Segmentation
    Tien Vu Nguyen
    Dinh Phung
    Rana, Santu
    Due Son Pham
    Venkatesh, Svetha
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1322 - 1325