ANIRA: An Architecture for Neural Network Inference in Real-Time Audio Applications

被引:1
|
作者
Ackva, Valentin [1 ]
Schulz, Fares [1 ]
机构
[1] Tech Univ Berlin, Audio Commun Grp, Berlin, Germany
关键词
neural network; real-time audio; inference engine; audio effects; deep learning; digital signal processing;
D O I
10.1109/IS262782.2024.10704099
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Numerous tools for neural network inference are currently available, yet many do not meet the requirements of real-time audio applications. In response, we introduce anira, an efficient cross-platform library. To ensure compatibility with a broad range of neural network architectures and frameworks, anira supports ONNX Runtime, LibTorch, and TensorFlow Lite as backends. Each inference engine exhibits real-time violations, which anira mitigates by decoupling the inference from the audio callback to a static thread pool. The library incorporates builtin latency management and extensive benchmarking capabilities, both crucial to ensure a continuous signal flow. Three different neural network architectures for audio effect emulation are then subjected to benchmarking across various configurations. Statistical modeling is employed to identify the influence of various factors on performance. The findings indicate that for stateless models, ONNX Runtime exhibits the lowest runtimes. For stateful models, LibTorch demonstrates the fastest performance. Our results also indicate that for certain model-engine combinations, the initial inferences take longer, particularly when these inferences exhibit a higher incidence of real-time violations.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [31] Feature Selection and Neural Network Architecture Evaluation for Real-Time Video Object Classification
    Curtis, Phillip
    Harb, Moufid
    Abielmona, Rami
    Petriu, Emil
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1038 - 1045
  • [32] System Architecture for Real-Time Comparison of Audio Streams for Broadcast Supervision
    Stankowski, Jakub
    Lorkiewicz, Mateusz
    Klimaszewski, Krzysztof
    IMAGE PROCESSING AND COMMUNICATIONS CHALLENGES 10, 2019, 892 : 245 - 252
  • [33] Wireless Audio Transmission System for Real-Time Applications - A Review
    Kuhite, Deepshika L.
    Madankar, Mangala S.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2017), 2017, : 195 - 199
  • [34] An integrated software architecture for real-time video and audio recording systems
    Kao, WC
    Chen, SH
    Sun, TH
    Chiang, TY
    Lin, SY
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2005, 51 (03) : 879 - 884
  • [35] Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
    Huang, Kai
    Gao, Wei
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 200 - 213
  • [36] A parallel neural processor for real-time applications
    Danese, G
    Leporati, F
    Ramat, S
    IEEE MICRO, 2002, 22 (03) : 20 - 31
  • [37] Real-Time Federated Evolutionary Neural Architecture Search
    Zhu, Hangyu
    Jin, Yaochu
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (02) : 364 - 378
  • [38] A Reconfigurable Architecture for Real-Time Prediction of Neural Activity
    Li, Will X. Y.
    Cheung, Ray C. C.
    Chan, Rosa H. M.
    Song, Dong
    Berger, Theodore W.
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 1869 - 1872
  • [39] Live Demonstration: Real-time audio and visual inference on the RAMAN TinyML accelerator
    Krishna, Adithya
    Rajesh, Ashwin
    Oleti, Hitesh Pavan
    Chauhan, Anand
    Shankaranarayanan, H.
    van Schaik, Andre
    Mehendale, Mahesh
    Thakur, Chetan Singh
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [40] Artificial neural architecture for real time modelling applications
    Petriu, EM
    Guergachi, A
    Patry, GG
    Zhao, L
    Petriu, DC
    Vukovich, G
    ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 639 - 650