Application of Neural Architecture Search to Instrument Recognition in Polyphonic Audio

被引:3
|
作者
Fricke, Leonard [1 ]
Vatolkin, Igor [1 ]
Ostermann, Fabian [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
Neural Architecture Search; Instrument Recognition; Music Information Retrieval; Hyperband Search; Bayesian Optimization;
D O I
10.1007/978-3-031-29956-8_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instrument recognition in polyphonic audio signals is a very challenging classification task. It helps to improve related application scenarios, like music transcription and recommendation, organization of large music collections, or analysis of historical trends and properties of musical styles. Recently, the classification performance could be improved by the integration of deep convolutional neural networks. However, in to date published studies, the network architectures and parameter settings were usually adopted from image recognition tasks and manually adjusted, without a systematic optimization. In this paper, we show how two different neural architecture search strategies can be successfully applied for improvement of the prediction of nine instrument classes, significantly outperforming the classification performance of three fixed baseline architectures from previous works. Although high computing efforts for model optimization are required, the training of the final architecture is done only once for later prediction of instruments in a possibly unlimited number of musical tracks.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 50 条
  • [1] Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio
    Blaszke, Maciej
    Korvel, Grazina
    Kostek, Bozena
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (05) : 25 - 36
  • [2] Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach
    Giannoulis, Dimitrios
    Klapuri, Anssi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1805 - 1817
  • [3] Instrument recognition in polyphonic music
    Essid, S
    Richard, G
    David, B
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 245 - 248
  • [4] Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures
    Vatolkin, Igor
    Preuss, Mike
    Rudolph, Guenter
    Eichhoff, Markus
    Weihs, Claus
    SOFT COMPUTING, 2012, 16 (12) : 2027 - 2047
  • [5] Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures
    Igor Vatolkin
    Mike Preuß
    Günter Rudolph
    Markus Eichhoff
    Claus Weihs
    Soft Computing, 2012, 16 : 2027 - 2047
  • [6] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music
    Han, Yoonchang
    Kim, Jaehun
    Lee, Kyogu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 208 - 221
  • [7] Predicting Key Recognition Difficulty in Polyphonic Audio
    Chuan, Ching-Hua
    Charapko, Aleksey
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 421 - 426
  • [8] Predicting key recognition difficulty in polyphonic audio
    Chuan, Ching-Hua
    Charapko, Aleksey
    Proceedings - 2013 IEEE International Symposium on Multimedia, ISM 2013, 2013, : 421 - 426
  • [9] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
    Kratimenos, Agelos
    Avramidis, Kleanthis
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
  • [10] Efficient neural architecture search for emotion recognition
    Verma, Monu
    Mandal, Murari
    Reddy, Satish Kumar
    Meedimale, Yashwanth Reddy
    Vipparthi, Santosh Kumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224