Estimating Ensemble Location and Width in Binaural Recordings of Music with Convolutional Neural Networks

被引:0
|
作者
Antoniuk, Pawel [1 ]
Zielinski, Slawomir K. [1 ]
机构
[1] Bialystok Tech Univ, Fac Comp Sci, Bialystok, Poland
关键词
ensemble width; ensemble location; binaural; spatial audio; localization; convolutional neural net- work; head-related transfer function; angle of arrival; SPATIAL AUDIO; SOUND SOURCE; ROBUST LOCALIZATION; HEAD MOVEMENTS; MODEL; SPEAKERS; DATABASE; FRONT;
D O I
10.24425/aoa.2025.153648
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binaural audio technology has been in existence for many years. However, its popularity has significantly increased over the past decade as a consequence of advancements in virtual reality and streaming techniques. Along with its growing popularity, the quantity of publicly accessible binaural audio recordings has also expanded. Consequently, there is now a need for automated and objective retrieval of spatial content information, with ensemble location and width being the most prominent. This study presents a novel method for estimating these ensemble parameters in binaural recordings of music. For this purpose, a dataset of 23 040 binaural recordings was synthesized from 192 publicly-available music recordings using 30 head-related transfer functions. The synthesized excerpts were then used to train a multi-task spectrogram-based convolutional neural network model, aiming to estimate the ensemble location and width for unseen recordings. The results indicate that a model for estimating ensemble parameters can be successfully constructed with low prediction errors: 4.76 circle (+/- 0.10 circle) for ensemble location and 8.57 circle (+/- 0.19 circle) for ensemble width. The method developed in this study outperforms previous spatiogram-based techniques recently published in the literature and shows promise for future development as part of a novel tool for binaural audio recordings analysis.
引用
收藏
页码:81 / 93
页数:13
相关论文
共 50 条
  • [21] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [22] Improved Music Genre Classification with Convolutional Neural Networks
    Zhang, Weibin
    Lei, Wenkang
    Xu, Xiangmin
    Xing, Xiaofeng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3304 - 3308
  • [23] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [24] Convolutional neural network ensemble for Parkinson's disease detection from voice recordings
    Hires, Mate
    Gazda, Matej
    Drotar, Peter
    Pah, Nemuel Daniel
    Motin, Mohammod Abdul
    Kumar, Dinesh Kant
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 141
  • [25] Convolutional neural network ensemble for Parkinson's disease detection from voice recordings
    Hireš, Máté
    Gazda, Matej
    Drotár, Peter
    Pah, Nemuel Daniel
    Motin, Mohammod Abdul
    Kumar, Dinesh Kant
    Computers in Biology and Medicine, 2022, 141
  • [26] Village Building Identification Based on Ensemble Convolutional Neural Networks
    Guo, Zhiling
    Chen, Qi
    Wu, Guangming
    Xu, Yongwei
    Shibasaki, Ryosuke
    Shao, Xiaowei
    SENSORS, 2017, 17 (11)
  • [27] Ensemble of convolutional neural networks to improve animal audio classification
    Loris Nanni
    Yandre M. G. Costa
    Rafael L. Aguiar
    Rafael B. Mangolin
    Sheryl Brahnam
    Carlos N. Silla
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [28] Segmentation of the uterine wall by an ensemble of fully convolutional neural networks
    Burai, Peter
    Hajdu, Andras
    Edgardo Manuel, Felipe-Riveron
    Harangi, Balazs
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 49 - 52
  • [29] Building an Ensemble of Convolutional Neural Networks for Classifying Panoramic Images
    P. O. Arkhipov
    S. L. Philippskih
    Pattern Recognition and Image Analysis, 2022, 32 : 511 - 514
  • [30] Ant genera identification using an ensemble of convolutional neural networks
    Marques, Alan Caio R.
    Raimundo, Marcos M.
    Cavalheiro, Ellen Marianne B.
    Salles, Luis F. P.
    Lyra, Christiano
    Von Zuben, Fernando J.
    PLOS ONE, 2018, 13 (01):