Application of Neural Architecture Search to Instrument Recognition in Polyphonic Audio

被引:3
|
作者
Fricke, Leonard [1 ]
Vatolkin, Igor [1 ]
Ostermann, Fabian [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
Neural Architecture Search; Instrument Recognition; Music Information Retrieval; Hyperband Search; Bayesian Optimization;
D O I
10.1007/978-3-031-29956-8_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instrument recognition in polyphonic audio signals is a very challenging classification task. It helps to improve related application scenarios, like music transcription and recommendation, organization of large music collections, or analysis of historical trends and properties of musical styles. Recently, the classification performance could be improved by the integration of deep convolutional neural networks. However, in to date published studies, the network architectures and parameter settings were usually adopted from image recognition tasks and manually adjusted, without a systematic optimization. In this paper, we show how two different neural architecture search strategies can be successfully applied for improvement of the prediction of nine instrument classes, significantly outperforming the classification performance of three fixed baseline architectures from previous works. Although high computing efforts for model optimization are required, the training of the final architecture is done only once for later prediction of instruments in a possibly unlimited number of musical tracks.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 50 条
  • [11] AutoSpeech: Neural Architecture Search for Speaker Recognition
    Ding, Shaojin
    Chen, Tianlong
    Gong, Xinyu
    Zha, Weiwei
    Wang, Zhangyang
    INTERSPEECH 2020, 2020, : 916 - 920
  • [12] NEURAL ARCHITECTURE SEARCH FOR SPEECH EMOTION RECOGNITION
    Wu, Xixin
    Hu, Shoukang
    Wu, Zhiyong
    Liu, Xunying
    Meng, Helen
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6902 - 6906
  • [13] Reinforcement Learning based Neural Architecture Search for Audio Tagging
    Liu, Haiyang
    Zhang, Cheng
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [14] Neural Architecture Search for Lightweight Neural Network in Food Recognition
    Tan, Ren Zhang
    Chew, XinYing
    Khaw, Khai Wah
    MATHEMATICS, 2021, 9 (11)
  • [15] Instrument recognition in polyphonic music based on automatic taxonomies
    Essid, S
    Richard, G
    David, B
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 68 - 80
  • [16] Indian Instrument Identification from Polyphonic Audio using KNN Classifier
    Chandan, S., V
    Naik, Mohan R.
    Ashwini
    Krishna, A. Vijay
    2019 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET 2019): ADVANCING WIRELESS AND MOBILE COMMUNICATIONS TECHNOLOGIES FOR 2020 INFORMATION SOCIETY, 2019, : 135 - 139
  • [17] AutoMER: Spatiotemporal Neural Architecture Search for Microexpression Recognition
    Verma, Monu
    Reddy, M. Satish Kumar
    Meedimale, Yashwanth Reddy
    Mandal, Murari
    Vipparthi, Santosh Kumar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6116 - 6128
  • [18] Evolutionary Neural Architecture Search for Facial Expression Recognition
    Deng, Shuchao
    Lv, Zeqiong
    Galvan, Edgar
    Sun, Yanan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (05): : 1405 - 1419
  • [19] Binarized Neural Architecture Search for Efficient Object Recognition
    Hanlin Chen
    Li’an Zhuo
    Baochang Zhang
    Xiawu Zheng
    Jianzhuang Liu
    Rongrong Ji
    David Doermann
    Guodong Guo
    International Journal of Computer Vision, 2021, 129 : 501 - 516
  • [20] Automatic Modulation Recognition Using Neural Architecture Search
    Wei, Shengyun
    Zou, Shun
    Liao, Feifan
    Lang, Weimin
    Wu, Wenhui
    2019 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2019, : 151 - 156