Application of Neural Architecture Search to Instrument Recognition in Polyphonic Audio

被引:3
|
作者
Fricke, Leonard [1 ]
Vatolkin, Igor [1 ]
Ostermann, Fabian [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
Neural Architecture Search; Instrument Recognition; Music Information Retrieval; Hyperband Search; Bayesian Optimization;
D O I
10.1007/978-3-031-29956-8_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instrument recognition in polyphonic audio signals is a very challenging classification task. It helps to improve related application scenarios, like music transcription and recommendation, organization of large music collections, or analysis of historical trends and properties of musical styles. Recently, the classification performance could be improved by the integration of deep convolutional neural networks. However, in to date published studies, the network architectures and parameter settings were usually adopted from image recognition tasks and manually adjusted, without a systematic optimization. In this paper, we show how two different neural architecture search strategies can be successfully applied for improvement of the prediction of nine instrument classes, significantly outperforming the classification performance of three fixed baseline architectures from previous works. Although high computing efforts for model optimization are required, the training of the final architecture is done only once for later prediction of instruments in a possibly unlimited number of musical tracks.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 50 条
  • [31] POLYPHONIC MUSICAL INSTRUMENT RECOGNITION BASED ON A DYNAMIC MODEL OF THE SPECTRAL ENVELOPE
    Burred, Juan Jose
    Roebel, Axel
    Sikora, Thomas
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 173 - +
  • [32] Predominant instrument recognition from polyphonic music using feature fusion
    Ajayakumar, Roshni
    Rajan, Rajeev
    EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY FOR SOCIETY, ENERGY AND ENVIRONMENT, 2018, : 721 - 726
  • [33] Neural architecture search for energy-efficient always-on audio machine learning
    Daniel T. Speckhard
    Karolis Misiunas
    Sagi Perel
    Tenghui Zhu
    Simon Carlile
    Malcolm Slaney
    Neural Computing and Applications, 2023, 35 : 12133 - 12144
  • [34] Neural architecture search for energy-efficient always-on audio machine learning
    Speckhard, Daniel T.
    Misiunas, Karolis
    Perel, Sagi
    Zhu, Tenghui
    Carlile, Simon
    Slaney, Malcolm
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (16): : 12133 - 12144
  • [35] Neural Architecture Search for Enhancing Action Video Recognition in Compressed Domains
    Lamkowski, Pedro
    Rodrigues, Douglas
    Passos, Leandro A.
    Papa, Joao P.
    Almeida, Jurandy
    2024 31ST INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, IWSSIP 2024, 2024,
  • [36] AutoGesNet: Auto Gesture Recognition Network Based on Neural Architecture Search
    Li, Yinqi
    Xu, Lu
    Shu, Weihua
    Tao, Ji'an
    Mei, Kuizhi
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 257 - 262
  • [37] EEG-Based Emotion Recognition via Neural Architecture Search
    Li, Chang
    Zhang, Zhongzhen
    Song, Rencheng
    Cheng, Juan
    Liu, Yu
    Chen, Xun
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 957 - 968
  • [38] LATENCY-CONTROLLED NEURAL ARCHITECTURE SEARCH FOR STREAMING SPEECH RECOGNITION
    He, Liqiang
    Feng, Shulin
    Su, Dan
    Yu, Dong
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 62 - 67
  • [39] License plate recognition using neural architecture search for edge devices
    Shashirangana, Jithmi
    Padmasiri, Heshan
    Meedeniya, Dulani
    Perera, Charith
    Nayak, Soumya R.
    Nayak, Janmenjoy
    Vimal, Shanmuganthan
    Kadry, Seifidine
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10211 - 10248
  • [40] Neural architecture search using genetic algorithm for facial expression recognition
    Deng, Shuchao
    Sun, Yanan
    Galvan, Edgar
    GECCO 2022 Companion - Proceedings of the 2022 Genetic and Evolutionary Computation Conference, 2022, : 423 - 426