Adaptive Multi-Resolution Attention with Linear Complexity

被引:0
|
作者
Zhang, Yao [1 ]
Ma, Yunpu [1 ]
Seidl, Thomas [1 ]
Tresp, Volker [1 ,2 ]
机构
[1] Ludwig Maximilians Univ Munchen, Inst Informat, Munich, Germany
[2] Siemens AG, Corp Technol, Munich, Germany
关键词
D O I
10.1109/IJCNN54540.2023.10191567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformers have improved the state-of-the-art across numerous tasks in sequence modeling. Besides the quadratic computational and memory complexity with respect to the sequence length, the self-attention mechanism only processes information at the same scale, i.e., all attention heads are in the same resolution, resulting in the limited power of the Transformer. To remedy this, we propose a novel and efficient structure named Adaptive Multi-Resolution Attention (AdaMRA for short), which scales linearly to sequence length in terms of time and space. Specifically, we leverage a multi-resolution multihead attention mechanism, enabling attention heads to capture long-range contextual information in a coarse-to-fine fashion. Moreover, to capture the potential relations between query representation and clues of different attention granularities, we leave the decision of which resolution of attention to use to query, which further improves the model's capacity compared to the vanilla Transformer. In an effort to reduce complexity, we adopt kernel attention without degrading the performance. Extensive experiments demonstrate the effectiveness and efficiency of our model by achieving state-of-the-art speed-memory-accuracy trade-off. To facilitate AdaMRA utilization by the scientific community, the implementation will be made publicly available.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] PanoDetNet: Multi-Resolution Panoramic Object Detection With Adaptive Feature Attention
    Liu, Wenhao
    Zhang, Taijie
    Xu, Shiting
    Chang, Qingling
    Cui, Yan
    [J]. IEEE ACCESS, 2024, 12 : 104300 - 104316
  • [2] Adaptive Boundaries in Multi-Resolution Simulations
    Wagoner, Jason A.
    Dill, Ken
    Pande, Vijay
    [J]. BIOPHYSICAL JOURNAL, 2015, 108 (02) : 182A - 182A
  • [3] Multi-Resolution Attention for Personalized Item Search
    Kocayusufoglu, Furkan
    Wu, Tao
    Singh, Anima
    Roumpos, Georgios
    Cheng, Heng-Tze
    Jain, Sagar
    Chi, Ed
    Singh, Ambuj
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 508 - 516
  • [4] Adaptive Multi-resolution Halftone Image Embedding
    Wen Shuangshuang
    Chen Guangxue
    Liu Zhen
    Chen Qifeng
    Xu Ruixin
    [J]. 2011 INTERNATIONAL CONFERENCE ON PHOTONICS, 3D-IMAGING, AND VISUALIZATION, 2011, 8205
  • [5] Adaptive multi-resolution labeling in virtual landscapes
    Chen, Chen
    Zhang, Liqiang
    Ma, Jingtao
    Kang, Zhizhong
    Liu, Liu
    Xue, Xiaojuan
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2010, 24 (06) : 949 - 964
  • [6] ADAPTIVE MULTI-RESOLUTION ENCODING FOR ABR STREAMING
    Goswami, Kalyan
    Hariharan, Bhavna
    Ramachandran, Pradeep
    Giladi, Alex
    Grois, Dan
    Sampath, Kavitha
    Matheswaran, Aruna
    Mishra, Ashok Kumar
    Pikus, Kevin
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1008 - 1012
  • [7] A MULTI-RESOLUTION APPROACH TO COMPLEXITY REDUCTION IN TOMOGRAPHIC RECONSTRUCTION
    Ma, Boxiao
    Zalmai, Nour
    Loeliger, Hans-Andrea
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6518 - 6522
  • [8] Multi-resolution character recognition by adaptive classification
    Liu, Chunmei
    Miao, Duoqian
    Wang, Chunheng
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 1182 - +
  • [9] IMAGE SUPER-RESOLUTION USING MULTI-RESOLUTION ATTENTION NETWORK
    Liu, Anqi
    Li, Sumei
    Chang, Yongli
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1610 - 1614
  • [10] MULTI-RESOLUTION MULTI-HEAD ATTENTION IN DEEP SPEAKER EMBEDDING
    Wang, Zhiming
    Yao, Kaisheng
    Li, Xiaolong
    Fang, Shuo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6464 - 6468