A Tree-Structured Multitask Model Architectures Recommendation System

被引:1
|
作者
Zhang, Lijun [1 ]
Liu, Xiao [1 ]
Guan, Hui [1 ]
机构
[1] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01002 USA
关键词
Task analysis; Computer architecture; Computational modeling; Training; Detectors; Predictive models; Recommender systems; Architecture representation; automated recommendation system; multitask learning (MTL);
D O I
10.1109/TNNLS.2023.3288537
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks with branched architectures, namely, tree-structured models, have been employed to jointly tackle multiple vision tasks in the context of multitask learning (MTL). Such tree-structured networks typically start with a number of shared layers, after which different tasks branch out into their own sequence of layers. Hence, the major challenge is to determine where to branch out for each task given a backbone model to optimize for both task accuracy and computation efficiency. To address the challenge, this article proposes a recommendation system that, given a set of tasks and a convolutional neural network-based backbone model, automatically suggests tree-structured multitask architectures that could achieve a high task performance while meeting a user-specified computation budget without performing model training. Extensive evaluations on popular MTL benchmarks show that the recommended architectures could achieve competitive task accuracy and computation efficiency compared with state-of-the-art MTL methods. Our tree-structured multitask model recommender is open-sourced and available at <uri>https://github.com/zhanglijun95/TreeMTL</uri>.
引用
收藏
页码:15578 / 15588
页数:11
相关论文
共 50 条
  • [1] A Tree-Structured Multitask Model Architectures Recommendation System
    Zhang, Lijun
    Liu, Xiao
    Guan, Hui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15578 - 15588
  • [2] Analysis of Tree-Structured Architectures for Code Generation
    Dahal, Samip
    Maharana, Adyasha
    Bansal, Mohit
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4382 - 4391
  • [3] PIPELINING TREE-STRUCTURED ALGORITHMS ON SIMD ARCHITECTURES
    BARNARD, DT
    SKILLICORN, DB
    INFORMATION PROCESSING LETTERS, 1990, 35 (02) : 79 - 84
  • [4] Synthesis of VLSI architectures for tree-structured image coding
    Park, N
    Bae, J
    Prasanna, VK
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 999 - 1002
  • [5] Tree-Structured Neural Topic Model
    Isonuma, Masaru
    Mori, Junichiro
    Bollegala, Danushka
    Sakata, Ichiro
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 800 - 806
  • [6] Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning
    Chen, Haokun
    Zhu, Chenxu
    Tang, Ruiming
    Zhang, Weinan
    He, Xiuqiang
    Yu, Yong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4018 - 4032
  • [7] A Nonparametric Regression Model With Tree-Structured Response
    Wang, Yuan
    Marron, J. S.
    Aydin, Burcu
    Ladha, Alim
    Bullitt, Elizabeth
    Wang, Haonan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1272 - 1285
  • [8] Large-Scale Interactive Recommendation with Tree-Structured Policy Gradient
    Chen, Haokun
    Dai, Xinyi
    Cai, Han
    Zhang, Weinan
    Wang, Xuejian
    Tang, Ruiming
    Zhang, Yuzhou
    Yu, Yong
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3312 - 3320
  • [9] A multiresolution tree-structured spatial linear model
    Zhu, J
    Yue, W
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2005, 14 (01) : 168 - 184
  • [10] Learning Tree-Structured Data in the Model Space
    Dong, Ya-dong
    Lv, Sheng-fei
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 258 - 266