Semantic Clustering for Robust Fine-Grained Scene Recognition

被引:8
|
作者
George, Marian [1 ]
Dixit, Mandar [2 ]
Zogg, Gabor [1 ]
Vasconcelos, Nuno [2 ]
机构
[1] ETH, Dept Comp Sci, Zurich, Switzerland
[2] Univ Calif San Diego, Stat & Visual Comp Lab, San Diego, CA USA
来源
关键词
D O I
10.1007/978-3-319-46448-0_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In domain generalization, the knowledge learnt from one or multiple source domains is transferred to an unseen target domain. In this work, we propose a novel domain generalization approach for fine-grained scene recognition. We first propose a semantic scene descriptor that jointly captures the subtle differences between fine-grained scenes, while being robust to varying object configurations across domains. We model the occurrence patterns of objects in scenes, capturing the informativeness and discriminability of each object for each scene. We then transform such occurrences into scene probabilities for each scene image. Second, we argue that scene images belong to hidden semantic topics that can be discovered by clustering our semantic descriptors. To evaluate the proposed method, we propose a new fine-grained scene dataset in cross-domain settings. Extensive experiments on the proposed dataset and three benchmark scene datasets show the effectiveness of the proposed approach for fine-grained scene transfer, where we outperform state-of-the-art scene recognition and domain generalization methods.
引用
收藏
页码:783 / 798
页数:16
相关论文
共 50 条
  • [1] A Semantic-driven Image Scene Fine-grained Enhancement Recognition
    Qu, Dongyang
    Li, Yaling
    Luo, Xiaoyan
    Shi, Xiaofeng
    [J]. SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
  • [2] Semantic bilinear pooling for fine-grained recognition
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
    [J]. Proc. Int. Conf. Pattern Recognit., (3660-3666):
  • [3] Semantic Bilinear Pooling for Fine-Grained Recognition
    Li, Xinjie
    Yang, Chun
    Chen, Song-Lu
    Zhu, Chao
    Yin, Xu-Cheng
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
  • [4] Fine-grained Pseudo Labels for Scene Text Recognition
    Li, Xiaoyu
    Chen, Xiaoxue
    Huang, Zuming
    Xie, Lele
    Chen, Jingdong
    Yang, Ming
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5786 - 5795
  • [5] Knowledge Mining with Scene Text for Fine-Grained Recognition
    Wang, Hao
    Liao, Junchao
    Cheng, Tianheng
    Gao, Zewen
    Liu, Hao
    Ren, Bo
    Bai, Xiang
    Liu, Wenyu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4614 - 4623
  • [6] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [7] Fine-Grained Recognition With Learnable Semantic Data Augmentation
    Pu, Yifan
    Han, Yizeng
    Wang, Yulin
    Feng, Junlan
    Deng, Chao
    Huang, Gao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3130 - 3144
  • [8] Semantic interaction learning for fine-grained vehicle recognition
    Zhang, Jingjing
    Lei, Jingsheng
    Yang, Shengying
    Yang, Xinqi
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (01)
  • [9] Discriminative semantic region selection for fine-grained recognition
    Zhang, Chunjie
    Wang, Da-Han
    Li, Haisheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77
  • [10] Pipelining Localized Semantic Features for Fine-Grained Action Recognition
    Zhou, Yang
    Ni, Bingbing
    Yan, Shuicheng
    Moulin, Pierre
    Tian, Qi
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 481 - 496