ProFormer: Towards On-Device LSH Projection Based Transformers

被引:0
|
作者
Sankar, Chinnadhurai [1 ]
Ravi, Sujith [2 ]
Kozareva, Zornitsa [3 ]
机构
[1] Univ Montreal, Mila, Montreal, PQ, Canada
[2] Amazon Alexa, Sunnyvale, CA USA
[3] Google, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At the heart of text based neural models lay word representations, which are powerful but occupy a lot of memory making it challenging to deploy to devices with memory constraints such as mobile phones, watches and IoT. To surmount these challenges, we introduce ProFormer - a projection based transformer architecture that is faster and lighter making it suitable to deploy to memory constraint devices and preserve user privacy. We use LSH projection layer to dynamically generate word representations on-the-fly without embedding lookup tables leading to significant memory footprint reduction from O(V:d) to O(T), where V is the vocabulary size, d is the embedding dimension size and T is the dimension of the LSH projection representation. We also propose a local projection attention (LPA) layer, which uses self-attention to transform the input sequence of N LSH word projections into a sequence of N=K representations reducing the computations quadratically by O(K-2). We evaluate ProFormer on multiple text classification tasks and observed improvements over prior state-of-the-art on-device approaches for short text classification and comparable performance for long text classification tasks. ProFormer is also competitive with other popular but highly resource-intensive approaches like BERT and even outperforms small-sized BERT variants with significant resource savings - reduces the embedding memory footprint from 92.16 MB to 1.7 KB and requires 16x less computation overhead, which is very impressive making it the fastest and smallest on-device model.
引用
收藏
页码:2823 / 2828
页数:6
相关论文
共 50 条
  • [1] OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning
    Thwal, Chu Myaet
    Nguyen, Minh N. H.
    Tun, Ye Lin
    Kim, Seong Tae
    Thai, My T.
    Hong, Choong Seon
    [J]. NEURAL NETWORKS, 2024, 170 : 635 - 649
  • [2] Accelerating Transformers with Fourier-Based Attention for Efficient On-Device Inference
    Jo, Hyeonjin
    Sim, Chaerin
    Park, Jaewoo
    Lee, Jongeun
    [J]. 2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 203 - 204
  • [3] On-device Structured and Context Partitioned Projection Networks
    Ravi, Sujith
    Kozareva, Zornitsa
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3784 - 3793
  • [4] Towards Independent On-device Artificial Intelligence
    Wu, Yawen
    Hu, Jingtong
    [J]. 2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 351 - 351
  • [5] ProSeqo: Projection Sequence Networks for On-Device Text Classification
    Kozareva, Zornitsa
    Ravi, Sujith
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3894 - 3903
  • [6] PRADO: Projection Attention Networks for Document Classification On-Device
    Kaliamoorthi, Prabhu
    Ravi, Sujith
    Kozareva, Zornitsa
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5012 - 5021
  • [7] Towards Semantic Management of On-Device Applications in Industrial IoT
    Ren, Haoyu
    Anicic, Darko
    Runkler, Thomas A.
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2022, 22 (04)
  • [8] Neuromemristive Multi-Layer Random Projection Network with On-Device Learning
    Zyarah, Abdullah M.
    Kudithipudi, Dhireesha
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Towards Communication-Efficient Model Updating for On-Device Session-Based Recommendation
    Xia, Xin
    Yu, Junliang
    Xu, Guandong
    Yin, Hongzhi
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023,
  • [10] A First Step Towards On-Device Monitoring of Body Sounds in the Wild
    Tailor, Shyam A.
    Chauhan, Jagmohan
    Mascolo, Cecilia
    [J]. UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 708 - 711