Efficient Batched Inference in Conditional Neural Networks

被引:0
|
作者
Selvam, Surya [1 ]
Nagarajan, Amrit [1 ,2 ]
Raghunathan, Anand [1 ]
机构
[1] Purdue University, Elmore Family School of Electrical and Computer Engineering, West Lafayette,IN,47907, United States
[2] IBM TJ Watson Research Center, Yorktown Heights,NY,10598, United States
关键词
D O I
10.1109/TCAD.2024.3445263
中图分类号
学科分类号
摘要
引用
收藏
页码:4081 / 4092
相关论文
共 50 条
  • [31] Linear Approximation of Deep Neural Networks for Efficient Inference on Video Data
    Rueckauer, Bodo
    Liu, Shih-Chii
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [32] Inference and Energy Efficient Design of Deep Neural Networks for Embedded Devices
    Galanis, Ioannis
    Anagnostopoulos, Iraklis
    Nguyen, Chinh
    Bares, Guillermo
    Burkard, Dona
    [J]. 2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 36 - 41
  • [33] Early-Exit with Class Exclusion for Efficient Inference of Neural Networks
    Wang, Jingcun
    Li, Bing
    Zhang, Grace Li
    [J]. 2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 263 - 267
  • [34] Efficient inference in large conditional random fields
    Cohn, Trevor
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 606 - 613
  • [35] MPE Inference in Conditional Linear Gaussian Networks
    Salmeron, Antonio
    Rumi, Rafael
    Langseth, Helge
    Madsen, Anders L.
    Nielsen, Thomas D.
    [J]. SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2015, 2015, 9161 : 407 - 416
  • [36] Bayesian inference in neural networks
    Marzban, C
    [J]. FIRST CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1998, : J25 - J30
  • [37] Bayesian inference in neural networks
    Marzban, C
    [J]. 14TH CONFERENCE ON PROBABILITY AND STATISTICS IN THE ATMOSPHERIC SCIENCES, 1998, : J97 - J102
  • [38] Bayesian inference in neural networks
    Paige, RL
    Butler, RW
    [J]. BIOMETRIKA, 2001, 88 (03) : 623 - 641
  • [39] An Efficient Channel-Aware Sparse Binarized Neural Networks Inference Accelerator
    Liu, Qingliang
    Lai, Jinmei
    Gao, Jiabao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1637 - 1641
  • [40] Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
    Shafiee, Mohammad Saeed
    Shafiee, Mohammad Javad
    Wong, Alexander
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 677 - 685