Application of Neural Architecture Search to Instrument Recognition in Polyphonic Audio

被引:3
|
作者
Fricke, Leonard [1 ]
Vatolkin, Igor [1 ]
Ostermann, Fabian [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
Neural Architecture Search; Instrument Recognition; Music Information Retrieval; Hyperband Search; Bayesian Optimization;
D O I
10.1007/978-3-031-29956-8_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instrument recognition in polyphonic audio signals is a very challenging classification task. It helps to improve related application scenarios, like music transcription and recommendation, organization of large music collections, or analysis of historical trends and properties of musical styles. Recently, the classification performance could be improved by the integration of deep convolutional neural networks. However, in to date published studies, the network architectures and parameter settings were usually adopted from image recognition tasks and manually adjusted, without a systematic optimization. In this paper, we show how two different neural architecture search strategies can be successfully applied for improvement of the prediction of nine instrument classes, significantly outperforming the classification performance of three fixed baseline architectures from previous works. Although high computing efforts for model optimization are required, the training of the final architecture is done only once for later prediction of instruments in a possibly unlimited number of musical tracks.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 50 条
  • [41] An Effective Radar Signal Recognition Method Using Neural Architecture Search
    Zhang, Min
    Luo, Wang
    Wang, Yu
    Sun, Jinlong
    Yang, Jie
    Ohtsuki, Tomoaki
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [42] Novel Deep Neural Architecture Search Algorithm for Human Activity Recognition
    Hoang, Anh Tuan
    Viharos, Zsolt Janos
    ERCIM NEWS, 2023, (132): : 30 - 31
  • [43] Neural architecture search using genetic algorithm for facial expression recognition
    Deng, Shuchao
    Sun, Yanan
    Galvan, Edgar
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 423 - 426
  • [44] Fisher Task Distance and its Application in Neural Architecture Search
    Le, Cat P.
    Soltani, Mohammadreza
    Dong, Juncheng
    Tarokh, Vahid
    IEEE ACCESS, 2022, 10 : 47235 - 47249
  • [45] Contrastive Neural Architecture Search with Neural Architecture Comparators
    Chen, Yaofo
    Guo, Yong
    Chen, Qi
    Li, Minli
    Zeng, Wei
    Wang, Yaowei
    Tan, Mingkui
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9497 - 9506
  • [46] DEEP CONVOLUTIONAL AND RECURRENT NETWORKS FOR POLYPHONIC INSTRUMENT CLASSIFICATION FROM MONOPHONIC RAW AUDIO WAVEFORMS
    Avramidis, Kleanthis
    Kratimenos, Agelos
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3010 - 3014
  • [47] Deep Neural Network Architecture: Application for Facial Expression Recognition
    Garcia, M.
    Ramirez, S.
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (07) : 1311 - 1319
  • [48] Polyphonic Note Transcription of Time-Domain Audio Signal with Deep WaveNet Architecture
    Martak, Lukas S.
    Sajgalik, Marius
    Benesova, Wanda
    2018 25TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2018,
  • [49] Generalisation Performance of Western Instrument Recognition Models in Polyphonic Mixtures with Ethnic Samples
    Vatolkin, Igor
    COMPUTATIONAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2017, 2017, 10198 : 304 - 320
  • [50] Recognition of Instrument Timbres in Real Polytimbral Audio Recordings
    Kubera, Elzbieta
    Wieczorkowska, Alicja
    Ras, Zbigniew
    Skrzypiec, Magdalena
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6322 : 97 - 110