Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

被引:0
|
作者
Fang, Gongfan [1 ,4 ]
Bao, Yifan [1 ]
Song, Jie [1 ]
Wang, Xinchao [2 ]
Xie, Donglin [1 ]
Shen, Chengchao [3 ]
Song, Mingli [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Cent South Univ, Changsha, Peoples R China
[4] Alibaba Zhejiang Univ Joint Inst Frontier Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge distillation (KD) aims to craft a compact student model that imitates the behavior of a pre-trained teacher in a target domain. Prior KD approaches, despite their gratifying results, have largely relied on the premise that in-domain data is available to carry out the knowledge transfer. Such an assumption, unfortunately, in many cases violates the practical setting, since the original training data or even the data domain is often unreachable due to privacy or copyright reasons. In this paper, we attempt to tackle an ambitious task, termed as out-of-domain knowledge distillation (OOD-KD), which allows us to conduct KD using only OOD data that can be readily obtained at a very low cost. Admittedly, OOD-KD is by nature a highly challenging task due to the agnostic domain gap. To this end, we introduce a handy yet surprisingly efficacious approach, dubbed as MosaicKD. The key insight behind MosaicKD lies in that, samples from various domains share common local patterns, even though their global semantic may vary significantly; these shared local patterns, in turn, can be re-assembled analogous to mosaic tiling, to approximate the in-domain data and to further alleviating the domain discrepancy. In MosaicKD, this is achieved through a four-player min-max game, in which a generator, a discriminator, a student network, are collectively trained in an adversarial manner, partially under the guidance of a pre-trained teacher. We validate MosaicKD over classification and semantic segmentation tasks across various benchmarks, and demonstrate that it yields results much superior to the state-of-the-art counterparts on OOD data. Our code is available at https://github.com/zju-vipa/MosaicKD.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data
    Lo, Tien-Hong
    Chen, Berlin
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1400 - 1404
  • [2] Using out-of-domain data to improve on-domain language models
    Iyer, R
    Ostendorf, M
    Gish, H
    IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) : 221 - 223
  • [3] CONTEXTUAL OUT-OF-DOMAIN UTTERANCE HANDLING WITH COUNTERFEIT DATA AUGMENTATION
    Lee, Sungjin
    Shalyminov, Igor
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7205 - 7209
  • [4] OodGAN: Generative Adversarial Network for Out-of-Domain Data Generation
    Marek, Petr
    Naik, Vishal Ishwar
    Auvray, Vincent
    Goyal, Anuj
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 238 - 245
  • [5] On Calibration and Out-of-domain Generalization
    Wald, Yoav
    Feder, Amir
    Greenfeld, Daniel
    Shalit, Uri
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
    Deng, Zhun
    Zhang, Linjun
    Ghorbani, Amirata
    Zou, James
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [7] GAN-BASED OUT-OF-DOMAIN DETECTION USING BOTH IN-DOMAIN AND OUT-OF-DOMAIN SAMPLES
    Liang, Chaojie
    Huang, Peijie
    Lai, Wenbin
    Ruan, Ziheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7663 - 7667
  • [8] A domain-knowledge based reconstruction framework for out-of-domain news title classification
    Yuan, Shi
    Liu, Ningning
    Sun, Bo
    Zhao, Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [9] FFM: Injecting Out-of-Domain Knowledge via Factorized Frequency Modification
    Wang, Zijian
    Luo, Yadan
    Huang, Zi
    Baktashmotlagh, Mahsa
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4124 - 4133
  • [10] Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech
    Christensen, H.
    Aniol, M. B.
    Bell, P.
    Green, P.
    Hain, T.
    King, S.
    Swietojanski, P.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3609 - 3612