Weakly-supervised scene parsing with multiple contextual cues

被引:3
|
作者
Li, Teng [1 ]
Wu, Xinyu [2 ]
Ni, Bingbing [3 ]
Lu, Ke [4 ]
Yan, Shuicheng [5 ]
机构
[1] Anhui Univ, Coll Elect Engn & Automat, Hefei, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing 100864, Peoples R China
[3] Adv Digital Sci Ctr, Singapore 138632, Singapore
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[5] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore
关键词
Scene parsing; Weakly-supervised; Multiple context; IMAGE; CLASSIFICATION; KERNELS;
D O I
10.1016/j.ins.2015.06.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene parsing, fully labeling an image with each region corresponding to a label, is one of the core problems of computer vision. Previous methods to this problem usually rely on patch-level models trained from well labeled data. In this paper, we propose a weakly-supervised scene parsing algorithm that semantically parses a collection of images with multi-label, which is guided by the top-down category models and bottom-up local patch contexts across images that closely related segments usually have similar labels. Images are segmented to patches on multi-level and the contextual relations of patches are discovered via sparse representation by l(1) minimization, based on which a graph is constructed. The multi-level spatial context of patches is also embedded in the graph, based on which image-level labels can be propagated to segments optimally. The contextual patch labeling process is formulated in an optimization framework and solved by a convergent iterative method. The category models are learned from the decomposed label representations of the image set and applied to the segments. Final labeling is obtained by combining all the information on pixel level. The effectiveness of the proposed method is demonstrated in experiments on two benchmark datasets and comparisons are taken. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:59 / 72
页数:14
相关论文
共 50 条
  • [21] Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization
    Fu, Jie
    Gao, Junyu
    Xu, Changsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12427 - 12443
  • [22] Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective
    Fan, Yingying
    Wu, Yu
    Du, Bo
    Lin, Yutian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective
    Fan, Yingying
    Wu, Yu
    Du, Bo
    Lin, Yutian
    Advances in Neural Information Processing Systems, 2023, 36
  • [24] SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval
    Yoon, Sunjae
    Koo, Gwanhyeong
    Kim, Dahyun
    Yoo, Chang D.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13530 - 13540
  • [25] Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning
    Qian, Rui
    Wei, Yunchao
    Shi, Honghui
    Li, Jiachen
    Liu, Jiaying
    Huang, Thomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8843 - 8850
  • [26] Weakly-Supervised Crack Detection
    Inoue, Yuki
    Nagayoshi, Hiroto
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12050 - 12061
  • [27] Weakly supervised parsing with rules
    Cerisara, C.
    Lorenzo, A.
    Kral, P.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2191 - 2195
  • [28] Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing
    Mo, Shentong
    Tian, Yapeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation
    Liu, Lizhao
    Zhuang, Zhuangwei
    Huang, Shangxin
    Xiao, Xunlong
    Xiang, Tianhang
    Chen, Cen
    Wang, Jingdong
    Tan, Mingkui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18367 - 18376
  • [30] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
    Cheng, Haoyue
    Liu, Zhaoyang
    Zhou, Hang
    Qian, Chen
    Wu, Wayne
    Wang, Limin
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 431 - 448