Multipitch tracking in music signals using Echo State Networks

被引:0
|
作者
Steiner, Peter [1 ]
Stone, Simon [1 ]
Birkholz, Peter [1 ]
Jalalvand, Azarakhsh [2 ]
机构
[1] Tech Univ Dresden, Inst Acoust & Speech Commun, Dresden, Germany
[2] Univ Ghent, IMEC, IDLab, Ghent, Belgium
关键词
Reservoir Computing; Echo State Network; Multipitch; RNN; MIR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Currently, convolutional neural networks (CNNs) define the state of the art for multipitch tracking in music signals. Echo State Networks (ESNs), a recently introduced recurrent neural network architecture, achieved similar results as CNNs for various tasks, such as phoneme or digit recognition. However, they have not yet received much attention in the community of Music Information Retrieval. The core of ESNs is a group of unordered, randomly connected neurons, i.e., the reservoir, by which the low-dimensional input space is non-linearly transformed into a high-dimensional feature space. Because only the weights of the connections between the reservoir and the output are trained using linear regression, ESNs are easier to train than deep neural networks. This paper presents a first exploration of ESNs for the challenging task of multipitch tracking in music signals. The best results presented in this paper were achieved with a bidirectional two-layer ESN with 20 000 neurons in each layer. Although the final F -score of 0.7198 still falls below the state of the art (0.7370), the proposed ESN-based approach serves as a baseline for further investigations of ESNs in audio signal processing in the future.
引用
收藏
页码:126 / 130
页数:5
相关论文
共 50 条
  • [1] Multipitch analysis of polyphonic music and speech signals using an auditory model
    Klapuri, Anssi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (02): : 255 - 266
  • [2] Speaker-dependent multipitch tracking using deep neural networks
    Liu, Yuzhou
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (02): : 710 - 721
  • [3] Speaker-dependent Multipitch Tracking Using Deep Neural Networks
    Liu, Yuzhou
    Wang, DeLiang
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3279 - 3283
  • [4] Denoising of discrete-time chaotic signals using echo state networks
    Duarte, Andre L. O.
    Eisencraft, Marcio
    SIGNAL PROCESSING, 2024, 214
  • [5] Improved Acoustic Modeling for Automatic Piano Music Transcription Using Echo State Networks
    Steiner, Peter
    Jalalvand, Azarakhsh
    Birkholz, Peter
    ADVANCES IN COMPUTATIONAL INTELLIGENCE (IWANN 2021), PT II, 2021, 12862 : 143 - 154
  • [6] Modeling of Cortical Signals Using Optimized Echo State Networks with Leaky Integrator Neurons
    Zhou, Hanying
    Wang, Yongji
    Huang, Jiangshuai
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 10 - 18
  • [7] Multipitch Tracking Using A Factorial Hidden Markov Model
    Wohlmayr, Michael
    Pernkopf, Franz
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 147 - 150
  • [8] Using Echo State Networks for Cryptography
    Ramamurthy, Rajkumar
    Bauckhage, Christian
    Buza, Krisztian
    Wrobel, Stefan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 663 - 671
  • [9] Real-Time Tumor Detection Using Electromagnetic Signals with Memristive Echo State Networks
    Nair, Vineeta V.
    George, Elizabeth
    James, Alex
    IEEE Internet of Things Journal, 2024, 11 (20) : 33712 - 33721
  • [10] Assembly of Echo State Networks Driven by Segregated Low Dimensional Signals
    Iinuma, Takahiro
    Nobukawa, Sou
    Yamaguchi, Satoshi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,