ANIRA: An Architecture for Neural Network Inference in Real-Time Audio Applications

被引:1
|
作者
Ackva, Valentin [1 ]
Schulz, Fares [1 ]
机构
[1] Tech Univ Berlin, Audio Commun Grp, Berlin, Germany
关键词
neural network; real-time audio; inference engine; audio effects; deep learning; digital signal processing;
D O I
10.1109/IS262782.2024.10704099
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Numerous tools for neural network inference are currently available, yet many do not meet the requirements of real-time audio applications. In response, we introduce anira, an efficient cross-platform library. To ensure compatibility with a broad range of neural network architectures and frameworks, anira supports ONNX Runtime, LibTorch, and TensorFlow Lite as backends. Each inference engine exhibits real-time violations, which anira mitigates by decoupling the inference from the audio callback to a static thread pool. The library incorporates builtin latency management and extensive benchmarking capabilities, both crucial to ensure a continuous signal flow. Three different neural network architectures for audio effect emulation are then subjected to benchmarking across various configurations. Statistical modeling is employed to identify the influence of various factors on performance. The findings indicate that for stateless models, ONNX Runtime exhibits the lowest runtimes. For stateful models, LibTorch demonstrates the fastest performance. Our results also indicate that for certain model-engine combinations, the initial inferences take longer, particularly when these inferences exhibit a higher incidence of real-time violations.
引用
收藏
页码:193 / 202
页数:10
相关论文
共 50 条
  • [41] Real-time implementation of the cerebellum neural network
    Hao, Xinyu
    Wang, Jiang
    Yang, Shuangming
    Deng, Bin
    Wei, Xile
    Yi, Guosheng
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3595 - 3599
  • [42] Supporting real-time applications in an integrated services packet network. Architecture and mechanism
    Clark, David D.
    Shenker, Scott
    Zhang, Lixia
    Computer Communications Review, 1992, 22 (04):
  • [43] REAL-TIME CONTROL USING A NEURAL NETWORK
    WOOD, D
    BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 69 - 76
  • [44] Comparing Optimization Methods of Neural Networks for Real-time Inference
    Khan, Mir
    Lunnikivi, Henri
    Huttunen, Heikki
    Boutellier, Jani
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [45] Real-Time GPU Audio
    Hsu, Bill
    Sosnick-Perez, Marc
    COMMUNICATIONS OF THE ACM, 2013, 56 (06) : 54 - 62
  • [46] Designing Real-Time Neural Networks by Efficient Neural Architecture Search
    Bo, Zitong
    Li, Yilin
    Qiao, Ying
    Leng, Chang
    Wang, Hongan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 62 - 73
  • [47] CROSSPOINT ARCHITECTURE BOOSTS COMPLEX REAL-TIME APPLICATIONS
    BOXER, A
    MEASUREMENTS & CONTROL, 1994, (165): : 96 - 99
  • [48] Parallel correlator architecture for real-time sensing applications
    Dostaler, M
    Petriu, EM
    Al-Dhaher, AH
    Groza, VZ
    PROCEEDINGS OF THE ISA/IEEE SENSORS FOR INDUSTRY CONFERENCE, 2002, : 180 - 185
  • [49] Building Real-Time Collaborative Applications with a Federated Architecture
    Ojanguren-Menendez, Pablo
    Tenorio-Fornes, Antonio
    Hassan, Samer
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2015, 3 (05): : 47 - 52
  • [50] An Architecture for Agile Machine Learning in Real-Time Applications
    Schleier-Smith, Johann
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 2059 - 2068