Sound Source Localization Inside a Structure Under Semi-Supervised Conditions

被引:2
|
作者
Kita, Shunsuke [1 ]
Kajikawa, Yoshinobu [2 ]
机构
[1] Osaka Res Inst Ind Sci & Technol, Div Elect & Mech Syst, Osaka 594115, Japan
[2] Kansai Univ, Fac Engn Sci, Osaka 5648680, Japan
关键词
Data models; Adaptation models; Acoustics; Speech processing; Predictive models; Location awareness; Training; Sound source localization; domain transfer; acoustic-structure coupling; t-distributed stochastic neighbor embedding;
D O I
10.1109/TASLP.2023.3263776
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a method for applying a sound source localization (SSL) model trained on simulated data in a real-world environment, with a domain transfer (DT) model for the SSL inside a structure. The DT model transfers real data into pseudo-simulation data. The SSL model trained on the simulation data is then adapted to the real data using the DT model. Our method consists of an SSL model and a DT model. The SSL model predicts the position of a sound source inside the structure, whereas the DT model transforms the data. Because our simulation is not perfect, real data are extrapolated for use with the SSL model. However, the data transformed by the DT model are interpolated within the feature space. The outcome is that the performance of the SSL model in the real world is improved. In our study, the frequency spectra of accelerometers observed on the outer surface of the structure are the model input. The goal is to predict the position of the sound source. The SSL model is built using deep and convolutional neural networks, and the DT model is built using either an autoencoder, a deep convolutional autoencoder, or pix2pix. The two-dimensional distributions of the t-distributed Stochastic Neighbor Embedding indicate that using pix2pix as the DT model shows the best performance. Furthermore, our method's performance for SSL is improved by 57% for the classification problem and by 27% for the regression problem when compared to the case where no transformation is applied.
引用
收藏
页码:1397 / 1408
页数:12
相关论文
共 50 条
  • [11] Semi-supervised protein subcellular localization
    Qian Xu
    Derek Hao Hu
    Hong Xue
    Weichuan Yu
    Qiang Yang
    [J]. BMC Bioinformatics, 10
  • [12] Semi-Supervised Multiple Source Localization Using Relative Harmonic Coefficients Under Noisy and Reverberant Environments
    Hu, Yonggang
    Samarasinghe, Prasanga N.
    Gannot, Sharon
    Abhayapala, Thushara D.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3108 - 3123
  • [13] Semi-supervised underwater acoustic source localization based on residual convolutional autoencoder
    Jin, Pian
    Wang, Biao
    Li, Lebo
    Chao, Peng
    Xie, Fangtong
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2022, 2022 (01)
  • [14] Semi-supervised underwater acoustic source localization based on residual convolutional autoencoder
    Pian Jin
    Biao Wang
    Lebo Li
    Peng Chao
    Fangtong Xie
    [J]. EURASIP Journal on Advances in Signal Processing, 2022
  • [15] Improving Landmark Localization with Semi-Supervised Learning
    Honari, Sina
    Molchanov, Pavlo
    Tyree, Stephen
    Vincent, Pascal
    Pal, Christopher
    Kautz, Jan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1546 - 1555
  • [16] Improving Localization for Semi-Supervised Object Detection
    Rossi, Leonardo
    Karimi, Akbar
    Prati, Andrea
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 516 - 527
  • [17] Sound source localization for source inside a structure using Ac-CycleGAN
    Kita, Shunsuke
    Park, Choong Sik
    Kajikawa, Yoshinobu
    [J]. JOURNAL OF SOUND AND VIBRATION, 2024, 591
  • [18] SEMI-SUPERVISED LEARNING HELPS IN SOUND EVENT CLASSIFICATION
    Zhang, Zixing
    Schuller, Bjoern
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 333 - 336
  • [19] COUPLE LEARNING FOR SEMI-SUPERVISED SOUND EVENT DETECTION
    Tao, Rui
    Yan, Long
    Ouchi, Kazushige
    Wang, Xiangdong
    [J]. INTERSPEECH 2022, 2022, : 2398 - 2402
  • [20] Semi-supervised Variational Autoencoder for WiFi Indoor Localization
    Chidlovskii, Boris
    Antsfeld, Leonid
    [J]. 2019 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2019,