ConvLSTM-based Sound Source Localization in a manufacturing workplace

被引:1
|
作者
Jalayer, Reza [1 ]
Jalayer, Masoud [1 ]
Mor, Andrea [1 ]
Orsenigo, Carlotta [1 ]
Vercellis, Carlo [1 ]
机构
[1] Politecn Milan, Dept Management Econ & Ind Engn, Via Lambruschini 4-b, I-20156 Milan, Italy
关键词
Industry; 5.0; Smart manufacturing; Sound source localization; Convolutional LSTM; Multiple sound sources; Moving sound sources; DOA ESTIMATION; CRNN;
D O I
10.1016/j.cie.2024.110213
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, Sound Source Localization (SSL) is explored as an approach to localize both human operators and machines emitting sound signals in a manufacturing workplace. In particular, a comprehensive analysis of the source localization ability of a state-of-the-art deep learning architecture in environments of increasing complexity is presented. Scenarios including single, dual, and multiple sound sources, in the form of both human and Computerized Numerical Control (CNC) machines, are investigated, as well as configurations with a mix of stationary and moving sources. Our work contributes to the extant literature by enriching previous research findings primarily devoted to single stationary sources. Furthermore, by focusing on the simultaneous and centralized detection of sources of different nature and type, it diverges from traditional SSL studies in manufacturing, which emphasize the localization of humans by robots in human-robot interaction, and presents a localization approach which enables a broader control over the workspace. For the localization task, a Convolutional LSTM architecture able to capture both spatial and temporal sound characteristics is also proposed, with each source assigned a dedicated model. Extensive experiments were carried out for each scenario in a simulated environment, where different levels of noise were also applied. The results showed the remarkable accuracy and robustness of the deep learning models when it comes to localizing single and dual stationary sources, as well as single moving sources. For multiple stationary and moving sources a general decline in the detection performance was observed, alongside a heightened sensitivity to noise.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Cleansed PHAT GCC based Sound Source Localization
    Lee, Sangmoon
    Park, Youngjin
    Park, Youn-sik
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 2051 - 2054
  • [32] Efficient and Privacy-Preserving ConvLSTM-Based Detection of Electricity Theft Cyber-Attacks in Smart Grids
    Anin, Johnson
    Khan, Muhammad Jahanzeb
    Abdelsalam, Omar
    Nabil, Mahmoud
    Hu, Fei
    Alsharif, Ahmad
    IEEE ACCESS, 2024, 12 : 153089 - 153104
  • [33] Soil Moisture Prediction Using NDVI and NSMI Satellite Data: ViT-Based Models and ConvLSTM-Based Model
    Habiboullah A.
    Louly M.A.
    SN Computer Science, 4 (2)
  • [34] Poverty Estimation Using a ConvLSTM-Based Model With Multisource Remote Sensing Data: A Case Study in Nigeria
    Tang, Jie
    Zhao, Xizhi
    Zhang, Fuhao
    Qiu, Agen
    Tao, Kunwang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3516 - 3529
  • [35] A novel method for predicting fatigue crack propagation path of surface cracks in pipelines with a ConvLSTM-based model
    Yu, Jianxing
    Su, Yefan
    Jin, Zihang
    Tian, Hanxu
    Zhao, Mingren
    INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2025, 214
  • [36] Poverty Estimation Using a ConvLSTM-Based Model With Multisource Remote Sensing Data: A Case Study in Nigeria
    Tang, Jie
    Zhao, Xizhi
    Zhang, Fuhao
    Qiu, Agen
    Tao, Kunwang
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 3516 - 3529
  • [37] Probabilistic sound source localization
    Lim, Yoon Seob
    Choi, Jong Suk
    Kim, Mun-Sang
    2007 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-6, 2007, : 767 - 770
  • [38] Sound localization on a horizontal surface: virtual and real sound source localization
    Jonathan Lam
    Bill Kapralos
    Kamen Kanev
    Karen Collins
    Andrew Hogue
    Michael Jenkin
    Virtual Reality, 2015, 19 : 213 - 222
  • [39] Sound localization on a horizontal surface: virtual and real sound source localization
    Lam, Jonathan
    Kapralos, Bill
    Kanev, Kamen
    Collins, Karen
    Hogue, Andrew
    Jenkin, Michael
    VIRTUAL REALITY, 2015, 19 (3-4) : 213 - 222
  • [40] SSLIDE: SOUND SOURCE LOCALIZATION FOR INDOORS BASED ON DEEP LEARNING
    Wu, Yifan
    Ayyalasomayajula, Roshan
    Bianco, Michael J.
    Bharadia, Dinesh
    Gerstoft, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4680 - 4684