MDL-NAS: A Joint Multi-domain Learning Framework for Vision Transformer

被引:8
|
作者
Wang, Shiguang [1 ,3 ]
Xie, Tao [2 ,3 ]
Cheng, Jian [1 ]
Zhang, Xingcheng [3 ]
Liu, Haijun [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Harbin Inst Technol, Harbin, Peoples R China
[3] SenseTime Res, Hong Kong, Peoples R China
[4] Chongqing Univ, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01924
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we introduce MDL-NAS, a unified framework that integrates multiple vision tasks into a manageable supernet and optimizes these tasks collectively under diverse dataset domains. MDL-NAS is storage-efficient since multiple models with a majority of shared parameters can be deposited into a single one. Technically, MDL-NAS constructs a coarse-to-fine search space, where the coarse search space offers various optimal architectures for different tasks while the fine search space provides fine-grained parameter sharing to tackle the inherent obstacles of multi-domain learning. In the fine search space, we suggest two parameter sharing policies, i.e., sequential sharing policy and mask sharing policy. Compared with previous works, such two sharing policies allow for the partial sharing and non-sharing of parameters at each layer of the network, hence attaining real fine-grained parameter sharing. Finally, we present a joint-subnet search algorithm that finds the optimal architecture and sharing parameters for each task within total resource constraints, challenging the traditional practice that downstream vision tasks are typically equipped with backbone networks designed for image classification. Experimentally, we demonstrate that MDL-NAS families fitted with non-hierarchical or hierarchical transformers deliver competitive performance for all tasks compared with state-of-the-art methods while maintaining efficient storage deployment and computation. We also demonstrate that MDL-NAS allows incremental learning and evades catastrophic forgetting when generalizing to a new task.
引用
收藏
页码:20094 / 20104
页数:11
相关论文
共 50 条
  • [41] Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning
    Xin, Qi
    Xin, Zengxian
    Chen, Tao
    REMOTE SENSING, 2024, 16 (11)
  • [42] Multi-Task and Multi-Domain Learning with Tensor Networks
    Garg, Yash
    Prater-Bennette, Ashley
    Asif, M. Salman
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII, 2023, 12547
  • [43] A Coupling Framework for Multi-Domain Modelling and Multi-Physics Simulations
    Amirante, Dario
    Ganine, Vlad
    Hills, Nicholas J.
    Adami, Paolo
    ENTROPY, 2021, 23 (06)
  • [44] A Multi-Domain Framework for Modelling Educational Games: Towards the Effective Development in Perspective of Learning Theories
    Ahmad, Mifrah
    Ab Rahim, Lukman
    Arshad, Noreen Izza
    2015 INTERNATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES AND COMPUTING RESEARCH (ISMSC), 2015, : 54 - 59
  • [45] Thoughts About the Role of the Metaverse in Multi-Domain Joint Operations
    Zhang, Hangfei
    Li, Lin
    Cheng, Guo
    Jin, Lute
    Lu, Ying
    PROCEEDINGS OF 2024 12TH CHINA CONFERENCE ON COMMAND AND CONTROL, VOL I, 2024, 1266 : 276 - 286
  • [46] Multi-domain Causal Structure Learning in Linear Systems
    Ghassami, AmirEmad
    Kiyavash, Negar
    Huang, Biwei
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [47] Cognitive Heterogeneous Multi-Domain Networks with Hierarchical Learning
    Ben Yoo, S. J.
    2018 IEEE PHOTONICS SOCIETY SUMMER TOPICAL MEETING SERIES (SUM), 2018,
  • [48] Budget-Aware Adapters for Multi-Domain Learning
    Berriel, Rodrigo
    Lathuiliere, Stephane
    Nabi, Moin
    Klein, Tassilo
    Oliveira-Santos, Thiago
    Sebe, Nicu
    Ricci, Elisa
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 382 - 391
  • [49] Learning a Pricing Strategy in Multi-Domain DWDM Networks
    Gurzi, Pasquale
    Steenhaut, Kris
    Nowe, Ann
    Vrancx, Peter
    2011 18TH IEEE WORKSHOP ON LOCAL AND METROPOLITAN AREA NETWORKS (LANMAN), 2011,
  • [50] Exploiting data diversity in multi-domain federated learning
    Madni, Hussain Ahmad
    Umer, Rao Muhammad
    Foresti, Gian Luca
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (02):