Probabilistic Model Distillation for Semantic Correspondence

被引:11
|
作者
Li, Xin [1 ]
Fan, Deng-Ping [2 ]
Yang, Fan [1 ]
Luo, Ao [3 ]
Cheng, Hong [4 ]
Liu, Zicheng [5 ]
机构
[1] Grp 42 G42, Abu Dhabi, U Arab Emirates
[2] Inception Inst AI, Abu Dhabi, U Arab Emirates
[3] Megvii Technol, Beijing, Peoples R China
[4] UESTC, Chengdu, Peoples R China
[5] Microsoft, Redmond, WA USA
关键词
D O I
10.1109/CVPR46437.2021.00742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic correspondence is a fundamental problem in computer vision, which aims at establishing dense correspondences across images depicting different instances under the same category. This task is challenging due to large intra-class variations and a severe lack of ground truth. A popular solution is to learn correspondences from synthetic data. However, because of the limited intra-class appearance and background variations within synthetically generated training data, the model's capability for handling "real" image pairs using such strategy is intrinsically constrained. We address this problem with the use of a novel Probabilistic Model Distillation (PMD) approach which transfers knowledge learned by a probabilistic teacher model on synthetic data to a static student model with the use of unlabeled real image pairs. A probabilistic supervision reweighting (PSR) module together with a confidence-aware loss (CAL) is used to mine the useful knowledge and alleviate the impact of errors. Experimental results on a variety of benchmarks show that our PMD achieves state-of-the-art performance. To demonstrate the generalizability of our approach, we extend PMD to incorporate stronger supervision for better accuracy - the probabilistic teacher is trained with stronger key-point supervision. Again, we observe the superiority of our PMD. The extensive experiments verify that PMD is able to infer more reliable supervision signals from the probabilistic teacher for representation learning and largely alleviate the influence of errors in pseudo labels. Cade is avaliable at https://github.com/fanyang587/PMD.
引用
收藏
页码:7501 / 7510
页数:10
相关论文
共 50 条
  • [1] A probabilistic model for semantic advertising
    Chen, Jin-Yuan
    Zheng, Hai-Tao
    Jiang, Yong
    Xia, Shu-Tao
    Zhao, Cong-Zhi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (02) : 387 - 412
  • [2] A probabilistic model for semantic advertising
    Jin-Yuan Chen
    Hai-Tao Zheng
    Yong Jiang
    Shu-Tao Xia
    Cong-Zhi Zhao
    [J]. Knowledge and Information Systems, 2019, 59 : 387 - 412
  • [3] A probabilistic model of binocular fixation and correspondence
    Hansard, Miles
    Horaud, Radu
    [J]. PERCEPTION, 2011, 40 (01) : 116 - 116
  • [4] A probabilistic model for Latent Semantic Indexing
    Ding, CHQ
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (06): : 597 - 608
  • [5] A Probabilistic Model of Semantic Plausibility in Sentence Processing
    Pado, Ulrike
    Crocker, Matthew W.
    Keller, Frank
    [J]. COGNITIVE SCIENCE, 2009, 33 (05) : 794 - 838
  • [6] Probabilistic Optimization of Semantic Process Model Matching
    Leopold, Henrik
    Niepert, Mathias
    Weidlich, Matthias
    Mendling, Jan
    Dijkman, Remco
    Stuckenschmidt, Heiner
    [J]. BUSINESS PROCESS MANAGEMENT, BPM 2012, 2012, 7481 : 319 - 334
  • [7] The Hidden Markov Topic Model: A Probabilistic Model of Semantic Representation
    Andrews, Mark
    Vigliocco, Gabriella
    [J]. TOPICS IN COGNITIVE SCIENCE, 2010, 2 (01) : 101 - 113
  • [8] Two Semantic Issues in a Probabilistic Rough Set Model
    Yao, Yiyu
    [J]. FUNDAMENTA INFORMATICAE, 2011, 108 (3-4) : 249 - 265
  • [9] Improving the multimodal probabilistic semantic model by ELM classifiers
    Zhang, Yu
    Yuan, Ye
    Guo, Fangda
    Wang, Yishu
    Wang, Guoren
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1967 - 1990
  • [10] A Probabilistic Model for Correspondence Problems Using Random Walks with Restart
    Kim, Tae Hooh
    Lee, Kyoung Mu
    Lee, Sang Uk
    [J]. COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 416 - 425