Learning Semantic Alignment Using Global Features and Multi-Scale Confidence

被引:0
|
作者
Xu, Huaiyuan [1 ]
Liao, Jing [2 ]
Liu, Huaping [3 ]
Sun, Yuxiang [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[3] Tsinghua Univ, Inst Artificial Intelligence, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Transformers; Training; Task analysis; Probabilistic logic; Semantic alignment; enhancement transformer; probabilistic correlation computation; cross-domain alignment;
D O I
10.1109/TCSVT.2023.3288370
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic alignment aims to establish pixel correspondences between images based on semantic consistency. It can serve as a fundamental component for various downstream computer vision tasks, such as style transfer and exemplar-based colorization, etc. Many existing methods use local features and their cosine similarities to infer semantic alignment. However, they struggle with significant intra-class variation of objects, such as appearance, size, etc. In other words, contents with the same semantics tend to be significantly different in vision. To address this issue, we propose a novel deep neural network of which the core lies in global feature enhancement and adaptive multi-scale inference. Specifically, two modules are proposed: an enhancement transformer for enhancing semantic features with global awareness; a probabilistic correlation module for adaptively fusing multi-scale information based on the learned confidence scores. We use the unified network architecture to achieve two types of semantic alignment, namely, cross-object semantic alignment and cross-domain semantic alignment. Experimental results demonstrate that our method achieves competitive performance on five standard cross-object semantic alignment benchmarks, and outperforms the state of the arts in cross-domain semantic alignment.
引用
收藏
页码:897 / 910
页数:14
相关论文
共 50 条
  • [41] Semantic segmentation of urban land classes using a multi-scale dataset
    Wang, Qian
    Hu, Chunhua
    Wang, Hanzhao
    Wang, Rui
    Xie, Yuning
    Zhao, Yuankun
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 653 - 675
  • [42] Integrating Color Information and Multi-Scale Geometric Features for Point Cloud Semantic Segmentation
    Zhang H.
    Xu R.
    Zheng N.
    Hao M.
    Liu D.
    Shi W.
    Journal of Geo-Information Science, 2024, 26 (06) : 1562 - 1575
  • [43] Deep Multi-Scale Features Learning for Distorted Image Quality Assessment
    Zhou, Wei
    Chen, Zhibo
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [44] Learning multi-scale synergic discriminative features for prostate image segmentation
    Jia, Haozhe
    Cai, Weidong
    Huang, Heng
    Xia, Yong
    PATTERN RECOGNITION, 2022, 126
  • [45] Aggregating multi-scale contextual features from multiple stages for semantic image segmentation
    Jiang, Dingchao
    Qu, Hua
    Zhao, Jihong
    Zhao, Jianlong
    Hsieh, Meng-Yen
    CONNECTION SCIENCE, 2021, 33 (03) : 605 - 622
  • [46] Scene understanding based on Multi-Scale Pooling of deep learning features
    Li, DongYang
    Zhou, Yue
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1732 - 1737
  • [47] Effective Point Cloud Analysis Using Multi-Scale Features
    Zheng, Qiang
    Sun, Jian
    SENSORS, 2021, 21 (16)
  • [48] Compact Multi-scale Periocular Recognition Using SAFE Features
    Alonso-Fernandez, Fernando
    Mikaelyan, Anna
    Bigun, Josef
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1455 - 1460
  • [49] Multi-scale Context Intertwining for Semantic Segmentation
    Lin, Di
    Ji, Yuanfeng
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 622 - 638
  • [50] Dynamic Multi-scale Filters for Semantic Segmentation
    He, Junjun
    Deng, Zhongying
    Qiao, Yu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3561 - 3571