Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models

被引:0
|
作者
Wu, Zhengxuan [1 ]
Liu, Nelson F. [1 ]
Potts, Christopher [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is growing evidence that pretrained language models improve task-specific fine-tuning even where the task examples are radically different from those seen in training. We study an extreme case of transfer learning by providing a systematic exploration of how much transfer occurs when models are denied any information about word identity via random scrambling. In four classification tasks and two sequence labeling tasks, we evaluate LSTMs using GloVe embeddings, BERT, and baseline models. Among these models, we find that only BERT shows high rates of transfer into our scrambled domains, and for classification but not sequence labeling tasks. Our analyses seek to explain why transfer succeeds for some tasks but not others, to isolate the separate contributions of pretraining versus fine-tuning, to show that the fine-tuning process is not merely learning to unscramble the scrambled inputs, and to quantify the role of word frequency. Furthermore, our results suggest that current benchmarks may overestimate the degree to which current models actually understand language.
引用
收藏
页码:100 / 110
页数:11
相关论文
共 50 条
  • [31] CO-CAPSULE NETWORKS BASED KNOWLEDGE TRANSFER FOR CROSS-DOMAIN RECOMMENDATION
    Li, Huiyuan
    Yu, Li
    Leng, Youfang
    Du, Qihan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3610 - 3614
  • [32] Mitigating Negative Transfer in Cross-Domain Recommendation via Knowledge Transferability Enhancement
    Song, Zijian
    Zhang, Wenhan
    Deng, Lifang
    Zhang, Jiandong
    Wu, Zhihua
    Bian, Kaigui
    Cui, Bin
    PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, : 2745 - 2754
  • [33] Source-Data-Free Cross-Domain Knowledge Transfer for Semantic Segmentation
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 92 - 100
  • [34] Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation
    Chen, Chaochao
    Wu, Huiwen
    Su, Jiajie
    Lyu, Lingjuan
    Zheng, Xiaolin
    Wang, Li
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1455 - 1465
  • [35] Translation as Cross-Domain Knowledge: Attention Augmentation for Unsupervised Cross-Domain Segmenting and Labeling Tasks
    Luo, Ruixuan
    Zhang, Yi
    Chen, Sishuo
    Sun, Xu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1896 - 1906
  • [36] Aligned Intrinsic User Factors Knowledge Transfer for Cross-domain Recommender Systems
    Sahu, Ashish Kumar
    Dwivedi, Pragya
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 363 - 372
  • [37] REPRESENTATION OF CROSS-DOMAIN DESIGN KNOWLEDGE THROUGH ONTOLOGY BASED FUNCTIONAL MODELS
    Marinov, Milan
    Gutu, Dan
    Todorova, Janet
    Szotz, Miklos
    Simonyi, Andras
    Ovtcharova, Jivka
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN (ICED 11): IMPACTING SOCIETY THROUGH ENGINEERING DESIGN, VOL 6: DESIGN INFORMATION AND KNOWLEDGE, 2011, 6 : 456 - 467
  • [38] ADA-AT/DT: An Adversarial Approach for Cross-Domain and Cross-Task Knowledge Transfer
    Chavhan, Ruchika
    Jha, Ankit
    Banerjee, Biplab
    Chaudhuri, Subhasis
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3501 - 3510
  • [39] Cross-domain Cross-modal Food Transfer
    Zhu, Bin
    Ngo, Chong-Wah
    Chen, Jing-jing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3762 - 3770
  • [40] Social Recommendation with Cross-Domain Transferable Knowledge
    Jiang, Meng
    Cui, Peng
    Chen, Xumin
    Wang, Fei
    Zhu, Wenwu
    Yang, Shiqiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (11) : 3084 - 3097