Adapter-Based Contextualized Meta Embeddings

被引:0
|
作者
O'Neill, James [1 ]
Dutta, Sourav [2 ]
机构
[1] DynamoFL Ireland, Dublin, Ireland
[2] Huawei Ireland Res Ctr, Dublin, Ireland
关键词
LoRA; Adapter; Meta Embedding; Multilingual;
D O I
10.1007/978-981-97-6125-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces MetaLoRA and MetaUniPELT, two meta-embedding approaches that extends Low Rank Adaptation (LoRA) and adapters for fine-tuning and combining multiple pretrained models. We find that both models improve performance across a range of monolingual and multilingual tasks, outperforming baselines such as fully fine-tuned single models, simple concatenation of pretrained embeddings with classification layer fine-tuning and soft-voting ensembles. On the XGLUE benchmark, we find a 1.7 test score increase over the best fully-fine tuned model and a 0.24 increase over the best fully-fine tuned ensemble on sentence classification tasks. Our results underscore the potential of parameter-efficient fine-tuning of ensembles as efficient and effective alternatives to full fine-tuning and standard ensemble methods.
引用
收藏
页码:82 / 90
页数:9
相关论文
共 50 条
  • [1] Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
    Wang, Zijian
    Lu, Haimei
    Yan, Haixin
    Kan, Hongxing
    Jin, Li
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
    Zijian Wang
    Haimei Lu
    Haixin Yan
    Hongxing Kan
    Li Jin
    Scientific Reports, 13
  • [3] ADAPTER-BASED INCREMENTAL LEARNING FOR FACE FORGERY DETECTION
    Gao, Caili
    Xu, Qisheng
    Qiao, Peng
    Xu, Kele
    Qian, Xifu
    Dou, Yong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4690 - 4694
  • [4] On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation
    He, Ruidan
    Liu, Linlin
    Ye, Hai
    Tan, Qingyu
    Ding, Bosheng
    Cheng, Liying
    Low, Jia-Wei
    Bing, Lidong
    Si, Luo
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2208 - 2222
  • [5] An adapter-based architecture for evaluating candidate solutions in energy system scheduling
    Chlosta M.
    Liu J.
    Poppenborg R.
    Lutz R.
    Förderer K.
    Schlachter T.
    Hagenmeyer V.
    Energy Informatics, 2022, 5 (Suppl 4)
  • [6] LARGE-SCALE SEQUENCING LIBRARY PRODUCTION - AN ADAPTER-BASED STRATEGY
    POVINELLI, CM
    GIBBS, RA
    ANALYTICAL BIOCHEMISTRY, 1993, 210 (01) : 16 - 26
  • [7] Multi-subsystem protocol architectures: Motivation and experience with an adapter-based approach
    Krupczak, B
    Ammar, M
    Calvert, K
    IEEE INFOCOM '96 - FIFTEENTH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND COMMUNICATIONS SOCIETIES: NETWORKING THE NEXT GENERATION, PROCEEDINGS VOLS 1-3, 1996, : 1149 - 1156
  • [8] LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling
    Ma, Kaijing
    Zang, Xianghao
    Feng, Zerun
    Fang, Han
    Ban, Chao
    Wei, Yuhan
    He, Zhongjiang
    Li, Yongxiang
    Sun, Hao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2790 - 2795
  • [9] Visually Analyzing Contextualized Embeddings
    Berger, Matthew
    2020 IEEE VISUALIZATION CONFERENCE - SHORT PAPERS (VIS 2020), 2020, : 276 - 280
  • [10] Contextualized dynamic meta embeddings based on Gated CNNs and self-attention for Arabic machine translation
    Bensalah, Nouhaila
    Ayad, Habib
    Adib, Abdellah
    El Farouk, Abdelhamid Ibn
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2024, 17 (03) : 605 - 631