Efficient Batched Inference in Conditional Neural Networks

被引:0
|
作者
Selvam, Surya [1 ]
Nagarajan, Amrit [1 ,2 ]
Raghunathan, Anand [1 ]
机构
[1] Purdue University, Elmore Family School of Electrical and Computer Engineering, West Lafayette,IN,47907, United States
[2] IBM TJ Watson Research Center, Yorktown Heights,NY,10598, United States
关键词
D O I
10.1109/TCAD.2024.3445263
中图分类号
学科分类号
摘要
引用
收藏
页码:4081 / 4092
相关论文
共 50 条
  • [41] Neural networks with circular filters enable data efficient inference of sequence motifs
    Blum, Christopher F.
    Kollmann, Markus
    [J]. BIOINFORMATICS, 2019, 35 (20) : 3937 - 3943
  • [42] Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
    Hoefler, Torsten
    Alistarh, Dan
    Ben-Nun, Tal
    Dryden, Nikoli
    Peste, Alexandra
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 23
  • [43] OnceNAS: Discovering efficient on-device inference neural networks for edge devices
    Zhang, Yusen
    Qin, Yunchuan
    Zhang, Yufeng
    Zhou, Xu
    Jian, Songlei
    Tan, Yusong
    Li, Kenli
    [J]. INFORMATION SCIENCES, 2024, 669
  • [44] Effective and efficient neural networks for spike inference from in vivo calcium imaging
    Zhou, Zhanhong
    Yip, Hei Matthew
    Tsimring, Katya
    Sur, Mriganka
    Tin, Chung
    Ip, Jacque Pak Kan
    [J]. CELL REPORTS METHODS, 2023, 3 (05):
  • [45] An efficient and flexible inference system for serving heterogeneous ensembles of deep neural networks
    Pochelu, Pierrick
    Petiton, Serge G.
    Conche, Bruno
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5225 - 5232
  • [46] Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks
    Hoefler, Torsten
    Alistarh, Dan
    Ben-Nun, Tal
    Dryden, Nikoli
    Peste, Alexandra
    [J]. Journal of Machine Learning Research, 2021, 22
  • [47] Quantized Deep Neural Networks for Energy Efficient Hardware-based Inference
    Ding, Ruizhou
    Liu, Zeye
    Blanton, R. D.
    Marculescu, Diana
    [J]. 2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 1 - 8
  • [48] Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
    Jacob, Benoit
    Kligys, Skirmantas
    Chen, Bo
    Zhu, Menglong
    Tang, Matthew
    Howard, Andrew
    Adam, Hartwig
    Kalenichenko, Dmitry
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2704 - 2713
  • [49] Exploring Fine-Grained Sparsity in Convolutional Neural Networks for Efficient Inference
    Wang, Longguang
    Guo, Yulan
    Dong, Xiaoyu
    Wang, Yingqian
    Ying, Xinyi
    Lin, Zaiping
    An, Wei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4474 - 4493
  • [50] Efficient Inference of Large-Scale and Lightweight Convolutional Neural Networks on FPGA
    Wu, Xiao
    Ma, Yufei
    Wang, Zhongfeng
    [J]. 2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC), 2020, : 168 - 173