On-device Structured and Context Partitioned Projection Networks

被引:0
|
作者
Ravi, Sujith [1 ]
Kozareva, Zornitsa [2 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Google, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A challenging problem in on-device text classification is to build highly accurate neural models that can fit in small memory footprint and have low latency. To address this challenge, we propose an on-device neural network SGNN++ which dynamically learns compact projection vectors from raw text using structured and context-dependent partition projections. We show that this results in accelerated inference and performance improvements. We conduct extensive evaluation on multiple conversational tasks and languages such as English, Japanese, Spanish and French. Our SGNN++ model significantly outperforms all baselines, improves upon existing on-device neural models and even surpasses RNN, CNN and BiLSTM models on dialog act and intent prediction. Through a series of ablation studies we show the impact of the partitioned projections and structured information leading to 10% improvement. We study the impact of the model size on accuracy and introduce quantization-aware training for SGNN++ to further reduce the model size while preserving the same quality. Finally, we show fast inference on mobile phones.
引用
收藏
页码:3784 / 3793
页数:10
相关论文
共 50 条
  • [1] ProSeqo: Projection Sequence Networks for On-Device Text Classification
    Kozareva, Zornitsa
    Ravi, Sujith
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3894 - 3903
  • [2] PRADO: Projection Attention Networks for Document Classification On-Device
    Kaliamoorthi, Prabhu
    Ravi, Sujith
    Kozareva, Zornitsa
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5012 - 5021
  • [3] ProFormer: Towards On-Device LSH Projection Based Transformers
    Sankar, Chinnadhurai
    Ravi, Sujith
    Kozareva, Zornitsa
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2823 - 2828
  • [4] On-Device Learning with Binary Neural Networks
    Vorabbi, Lorenzo
    Maltoni, Davide
    Santi, Stefano
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 39 - 50
  • [5] Advancements in On-Device Deep Neural Networks
    Saravanan, Kavya
    Kouzani, Abbas Z.
    [J]. INFORMATION, 2023, 14 (08)
  • [6] Hybrid Neural Networks for On-Device Directional Hearing
    Wang, Anran
    Kim, Maruchi
    Zhang, Hao
    Gollakota, Shyamnath
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11421 - 11430
  • [7] On-Device Learning in Memristor Spiking Neural Networks
    Zyarah, Abdullah M.
    Soures, Nicholas
    Kudithipudi, Dhireesha
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [8] Neuromemristive Multi-Layer Random Projection Network with On-Device Learning
    Zyarah, Abdullah M.
    Kudithipudi, Dhireesha
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Context-aware Model Selection for On-Device Object Detection
    Kang, Seongju
    Jeong, Chaeeun
    Chung, Kwangue
    [J]. 35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 662 - 666
  • [10] Secure Distributed On-Device Learning Networks with Byzantine Adversaries
    Dong, Yanjie
    Cheng, Julian
    Hossain, Md Jahangir
    Leung, Victor C. M.
    [J]. IEEE NETWORK, 2019, 33 (06): : 180 - 187