Application of Neural Architecture Search to Instrument Recognition in Polyphonic Audio

被引：3

作者：

Fricke, Leonard ^{[1
]}

Vatolkin, Igor ^{[1
]}

Ostermann, Fabian ^{[1
]}

机构：

[1] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany

来源：

ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2023 | 2023年 / 13988卷

关键词：

Neural Architecture Search; Instrument Recognition; Music Information Retrieval; Hyperband Search; Bayesian Optimization;

D O I：

10.1007/978-3-031-29956-8_8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Instrument recognition in polyphonic audio signals is a very challenging classification task. It helps to improve related application scenarios, like music transcription and recommendation, organization of large music collections, or analysis of historical trends and properties of musical styles. Recently, the classification performance could be improved by the integration of deep convolutional neural networks. However, in to date published studies, the network architectures and parameter settings were usually adopted from image recognition tasks and manually adjusted, without a systematic optimization. In this paper, we show how two different neural architecture search strategies can be successfully applied for improvement of the prediction of nine instrument classes, significantly outperforming the classification performance of three fixed baseline architectures from previous works. Although high computing efforts for model optimization are required, the training of the final architecture is done only once for later prediction of instruments in a possibly unlimited number of musical tracks.

引用

页码：117 / 131

页数：15

共 50 条

[1] Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio
Blaszke, Maciej
Korvel, Grazina
Kostek, Bozena
IEEE INTELLIGENT SYSTEMS, 2024, 39 (05) : 25 - 36
[2] Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach
Giannoulis, Dimitrios
Klapuri, Anssi
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1805 - 1817
[3] Instrument recognition in polyphonic music
Essid, S
Richard, G
David, B
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 245 - 248
[4] Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures
Vatolkin, Igor
Preuss, Mike
Rudolph, Guenter
Eichhoff, Markus
Weihs, Claus
SOFT COMPUTING, 2012, 16 (12) : 2027 - 2047
[5] Multi-objective evolutionary feature selection for instrument recognition in polyphonic audio mixtures
Igor Vatolkin
Mike Preuß
Günter Rudolph
Markus Eichhoff
Claus Weihs
Soft Computing, 2012, 16 : 2027 - 2047
[6] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music
Han, Yoonchang
Kim, Jaehun
Lee, Kyogu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 208 - 221
[7] Predicting Key Recognition Difficulty in Polyphonic Audio
Chuan, Ching-Hua
Charapko, Aleksey
2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 421 - 426
[8] Predicting key recognition difficulty in polyphonic audio
Chuan, Ching-Hua
Charapko, Aleksey
Proceedings - 2013 IEEE International Symposium on Multimedia, ISM 2013, 2013, : 421 - 426
[9] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
Kratimenos, Agelos
Avramidis, Kleanthis
Garoufis, Christos
Zlatintsi, Athanasia
Maragos, Petros
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
[10] Efficient neural architecture search for emotion recognition
Verma, Monu
Mandal, Murari
Reddy, Satish Kumar
Meedimale, Yashwanth Reddy
Vipparthi, Santosh Kumar
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224

← 1 2 3 4 5 →