MDL-NAS: A Joint Multi-domain Learning Framework for Vision Transformer

被引:8
|
作者
Wang, Shiguang [1 ,3 ]
Xie, Tao [2 ,3 ]
Cheng, Jian [1 ]
Zhang, Xingcheng [3 ]
Liu, Haijun [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Harbin Inst Technol, Harbin, Peoples R China
[3] SenseTime Res, Hong Kong, Peoples R China
[4] Chongqing Univ, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01924
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we introduce MDL-NAS, a unified framework that integrates multiple vision tasks into a manageable supernet and optimizes these tasks collectively under diverse dataset domains. MDL-NAS is storage-efficient since multiple models with a majority of shared parameters can be deposited into a single one. Technically, MDL-NAS constructs a coarse-to-fine search space, where the coarse search space offers various optimal architectures for different tasks while the fine search space provides fine-grained parameter sharing to tackle the inherent obstacles of multi-domain learning. In the fine search space, we suggest two parameter sharing policies, i.e., sequential sharing policy and mask sharing policy. Compared with previous works, such two sharing policies allow for the partial sharing and non-sharing of parameters at each layer of the network, hence attaining real fine-grained parameter sharing. Finally, we present a joint-subnet search algorithm that finds the optimal architecture and sharing parameters for each task within total resource constraints, challenging the traditional practice that downstream vision tasks are typically equipped with backbone networks designed for image classification. Experimentally, we demonstrate that MDL-NAS families fitted with non-hierarchical or hierarchical transformers deliver competitive performance for all tasks compared with state-of-the-art methods while maintaining efficient storage deployment and computation. We also demonstrate that MDL-NAS allows incremental learning and evades catastrophic forgetting when generalizing to a new task.
引用
收藏
页码:20094 / 20104
页数:11
相关论文
共 50 条
  • [21] MULTI-DOMAIN LEARNING BY META-LEARNING: TAKING OPTIMAL STEPS IN MULTI-DOMAIN LOSS LANDSCAPES BY INNER-LOOP LEARNING
    Sicilia, Anthony
    Zhao, Xingchen
    Minhas, Davneet S.
    O'Connor, Erin E.
    Aizenstein, Howard J.
    Klunk, William E.
    Tudorascu, Dana L.
    Hwang, Seong Jae
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 650 - 654
  • [22] Efficient Multi-Domain Learning by Covariance Normalization
    Li, Yunsheng
    Vasconcelos, Nuno
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5419 - 5428
  • [23] Unpaired Multi-Domain Causal Representation Learning
    Sturma, Nils
    Squires, Chandler
    Drton, Mathias
    Uhler, Caroline
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Multi-Domain Incremental Learning for Semantic Segmentation
    Garg, Prachi
    Saluja, Rohit
    Balasubramanian, Vineeth N.
    Arora, Chetan
    Subramanian, Anbumani
    Jawahar, C., V
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2080 - 2090
  • [25] Towards Learning Multi-Domain Crowd Counting
    Yan, Zhaoyi
    Li, Pengyu
    Wang, Biao
    Ren, Dongwei
    Zuo, Wangmeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6544 - 6557
  • [26] Argmax Centroids: with Applications to Multi-domain Learning
    Gong, Chengyue
    Ye, Mao
    Liu, Qiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Multi-Domain Generalized Graph Meta Learning
    Lin, Mingkai
    Li, Wenzhong
    Li, Ding
    Chen, Yizhou
    Li, Guohao
    Lu, Sanglu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4479 - 4487
  • [28] EFFICIENT MULTI-DOMAIN DICTIONARY LEARNING WITH GANS
    Wu, Cho Ying
    Neumann, Ulrich
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [29] Collaborative Learning in Multi-Domain Optical Networks
    Chen, Xiaoliang
    Proietti, Roberto
    Liu, Che-Yu
    Ben Yoo, S. J.
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [30] A Tensor Based Framework for Multi-Domain Communication Systems
    Venugopal, Adithya
    Leib, Harry
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 : 606 - 633