Multiple-Channel Audio Construction Equipment Dataset Preparation for Sound Detection and Localization to Prevent Collision Hazards

被引:0
|
作者
Elelu, Kehinde [1 ]
Le, Tuyen [1 ]
Le, Chau [2 ]
机构
[1] Clemson Univ, Glenn Civil Engn Dept, Clemson, SC 29634 USA
[2] North Dakota State Univ, Dept Civil Construct & Environm Engn, Fargo, ND USA
关键词
D O I
暂无
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Construction workplaces often face unforeseen struck-by equipment hazards, leading to severe injuries and fatalities for workers. Detecting and localizing equipment sounds using multi-channel audio data has drawn interest in research. However, collecting such data for developing sound detection and localization machine learning models is challenging. Physical recordings on site required for deep learning are often infeasible due to the lack of proper sound attribute labels from heterogeneous construction sounds. This paper introduces a novel method for synthesizing overlapping and non-overlapping sound datasets in a three-dimensional space, utilizing Pyroomacoustics. The approach uses single sound data with attributes like start time, end time, azimuth, and elevation as microphone input to generate multi-channel audio output. The study successfully simulates 5,025 distinct scenario audios for both datasets, utilizing seven single-sound audiotapes. The generated large dataset can train neural network models capable of localizing equipment collision hazards in construction sites.
引用
收藏
页码:487 / 496
页数:10
相关论文
共 1 条