Multi-path Neural Networks for On-device Multi-domain Visual Classification

被引:0
|
作者
Wang, Qifei [1 ]
Ke, Junjie [1 ]
Greaves, Joshua [2 ]
Chu, Grace [1 ]
Bender, Gabriel [2 ]
Sbaiz, Luciano [1 ]
Go, Alec [1 ]
Howard, Andrew [1 ]
Yang, Ming-Hsuan [1 ]
Gilbert, Jeff [1 ]
Milanfar, Peyman [1 ]
Yang, Feng [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Google Brain, Mountain View, CA USA
关键词
D O I
10.1109/WACV48630.2021.00306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning multiple domains/tasks with a single model is important for improving data efficiency and lowering inference cost for numerous vision tasks, especially on resource-constrained mobile devices. However, hand-crafting a multi-domain/task model can be both tedious and challenging. This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classification on mobile devices. The proposed multi-path network is learned from neural architecture search by applying one reinforcement learning controller for each domain to select the best path in the super-network created from a MobileNetV3-like search space. An adaptive balanced domain prioritization algorithm is proposed to balance optimizing the joint model on multiple domains simultaneously. The determined multi-path model selectively shares parameters across domains in shared nodes while keeping domain-specific parameters within non-shared nodes in individual domain paths. This approach effectively reduces the total number of parameters and FLOPS, encouraging positive knowledge transfer while mitigating negative interference across domains. Extensive evaluations on the Visual Decathlon dataset demonstrate that the proposed multi-path model achieves state-of-the-art performance in terms of accuracy, model size, and FLOPS against other approaches using MobileNetV3-like architectures. Furthermore, the proposed method improves average accuracy over learning single-domain models individually, and reduces the total number of parameters and FLOPS by 78% and 32% respectively, compared to the approach that simply bundles single-domain models for multi-domain learning.
引用
收藏
页码:3018 / 3027
页数:10
相关论文
共 50 条
  • [1] Learning Multi-Domain Convolutional Neural Networks for Visual Tracking
    Nam, Hyeonseob
    Han, Bohyung
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4293 - 4302
  • [2] Learning Multi-Domain Adversarial Neural Networks for Text Classification
    Ding, Xiao
    Shi, Qiankun
    Cai, Bibo
    Liu, Ting
    Zhao, Yanyan
    Ye, Qiang
    IEEE ACCESS, 2019, 7 : 40323 - 40332
  • [3] Path Computation in Multi-layer Multi-domain Networks
    Lamali, Mohamed Lamine
    Pouyllau, Helia
    Barth, Dominique
    NETWORKING 2012, PT I, 2012, 7289 : 421 - 433
  • [4] An inter-domain multi-path flow transfer mechanism based on SDN and multi-domain collaboration
    Lu You
    Li Wei
    Luo Junzhou
    Jiang Jian
    Xia Nu
    PROCEEDINGS OF THE 2015 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM), 2015, : 758 - 761
  • [5] Heterogeneous Multi-Domain Multi-Path Routing and Resource Sharing Allocation in Hybrid Elastic Fiber-Wireless Networks
    Zhang, Zhan
    Yin, Shan
    Yang, Chen
    Chen, Leiyu
    Chu, Yaqin
    Huang, Shanguo
    2019 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2019,
  • [6] Novel Path Protection Scheme for Multi-Domain Networks
    Xu, F.
    Gu, F.
    Alazemi, H.
    Peng, M.
    Ghani, Nasir
    2011 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2011, : 130 - 135
  • [7] Pediatric Sleep Stage Classification Using Multi-Domain Hybrid Neural Networks
    Jeon, Yonghoon
    Kim, Siwon
    Choi, Hyun-Soo
    Chung, Yoon Gi
    Choi, Sun Ah
    Kim, Hunmin
    Yoon, Sungroh
    Hwang, Hee
    Kim, Ki Joong
    IEEE ACCESS, 2019, 7 : 96495 - 96505
  • [8] Multi-path x-D recurrent neural networks for collaborative image classification
    Gao, Riqiang
    Huo, Yuankai
    Bao, Shunxing
    Tang, Yucheng
    Antic, Sanja L.
    Epstein, Emily S.
    Deppen, Steve
    Paulson, Alexis B.
    Sandler, Kim L.
    Massion, Pierre P.
    Landman, Bennett A.
    NEUROCOMPUTING, 2020, 397 : 48 - 59
  • [9] Multi-Path Learnable Wavelet Neural Network for Image Classification
    De Silva, D. D. N.
    Vithanage, H. W. M. K.
    Fernando, K. S. D.
    Piyatilake, I. T. S.
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [10] Multi-Path and Group-Loss-Based Network for Speech Emotion Recognition in Multi-Domain Datasets
    Noh, Kyoung Ju
    Jeong, Chi Yoon
    Lim, Jiyoun
    Chung, Seungeun
    Kim, Gague
    Lim, Jeong Mook
    Jeong, Hyuntae
    SENSORS, 2021, 21 (05) : 1 - 18