ANIRA: An Architecture for Neural Network Inference in Real-Time Audio Applications

被引:1
|
作者
Ackva, Valentin [1 ]
Schulz, Fares [1 ]
机构
[1] Tech Univ Berlin, Audio Commun Grp, Berlin, Germany
关键词
neural network; real-time audio; inference engine; audio effects; deep learning; digital signal processing;
D O I
10.1109/IS262782.2024.10704099
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Numerous tools for neural network inference are currently available, yet many do not meet the requirements of real-time audio applications. In response, we introduce anira, an efficient cross-platform library. To ensure compatibility with a broad range of neural network architectures and frameworks, anira supports ONNX Runtime, LibTorch, and TensorFlow Lite as backends. Each inference engine exhibits real-time violations, which anira mitigates by decoupling the inference from the audio callback to a static thread pool. The library incorporates builtin latency management and extensive benchmarking capabilities, both crucial to ensure a continuous signal flow. Three different neural network architectures for audio effect emulation are then subjected to benchmarking across various configurations. Statistical modeling is employed to identify the influence of various factors on performance. The findings indicate that for stateless models, ONNX Runtime exhibits the lowest runtimes. For stateful models, LibTorch demonstrates the fastest performance. Our results also indicate that for certain model-engine combinations, the initial inferences take longer, particularly when these inferences exhibit a higher incidence of real-time violations.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [1] Real-time architecture for neural network applications
    Crespo, A
    Hassan, H
    Andreu, G
    Simo, J
    REAL TIME PROGRAMMING 1997: (WRTP 97), 1998, : 23 - 28
  • [2] Real-time inference in a VLSI spiking neural network
    Corneil, Dane
    Sonnleithner, Daniel
    Neftci, Emre
    Chicca, Elisabetta
    Cook, Matthew
    Indiveri, Giacomo
    Douglas, Rodney
    2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 2425 - 2428
  • [3] Efficient Nonlinear Autoregressive Neural Network Architecture for Real-Time Biomedical Applications
    Olney, Brooks
    Mahmud, Shakil
    Karam, Robert
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 411 - 414
  • [4] FlowGNN: A Dataflow Architecture for Real-Time Workload-Agnostic Graph Neural Network Inference
    Sarkar, Rishov
    Abi-Karam, Stefan
    He, Yuqi
    Sathidevi, Lakshmi
    Hao, Cong
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 1099 - 1112
  • [5] RealNet:: a neural network architecture for real-time systems scheduling
    Domínguez, E
    Jerez, J
    Llopis, L
    Morante, A
    NEURAL COMPUTING & APPLICATIONS, 2004, 13 (04): : 281 - 287
  • [6] RealNet: a neural network architecture for real-time systems scheduling
    E. Domínguez
    J. Jerez
    L. Llopis
    A. Morante
    Neural Computing & Applications, 2004, 13 : 281 - 287
  • [7] Real-Time Inference of Neural Networks on FPGAs for Motor Control Applications
    Schindler, Tobias
    Dietz, Armin
    2020 10TH INTERNATIONAL ELECTRIC DRIVES PRODUCTION CONFERENCE (EDPC), 2020, : 318 - 323
  • [8] Real-Time Audio Multicasting on Bluetooth Network
    Pinkumphi, Sayam
    Phonphoem, Anan
    ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 958 - 961
  • [9] Real-time Sign Language Recognition based on Neural Network Architecture
    Mekala, Priyanka
    Gao, Ying
    Fan, Jeffrey
    Davari, Asad
    PROCEEDINGS SSST 2011: 43RD IEEE SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2011, : 195 - 199
  • [10] Design of a pipelined hardware architecture for real-time neural network computations
    Ayala, JL
    Lomeña, AG
    López-Vallejo, M
    Fernández, A
    2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 419 - 422