A novel visual representation method for multi-dimensional sound scene analysis in source localization problem

被引:0
|
作者
Jung, In-Jee [1 ]
Cho, Wan-Ho [1 ]
机构
[1] Korea Res Inst Stand & Sci KRISS, Div Phys Metrol, Acoust Ultrasound Vibrat Metrol Grp, Daejeon 34113, South Korea
关键词
Sound source localization; Visualization; Direction of arrival; Human-interpretable image dataset; Data augmentation; Sound scene analysis; NOISE;
D O I
10.1016/j.ymssp.2023.110977
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
A visual representation method, DoAgram, for multi-dimensional sound scene analysis is suggested. The visual representation of the sound source localization result gives intuitive information about the estimated one to the end user. Also, image-based deep learning is popularly used in the acoustic field nowadays, so such a visualized one can be used for data augmentation. To analyze the spatial sound scene for the moving source, the method displays the estimated azimuth angle and elevation angle of the source, and its corresponding time stamp and frequencies as RGB color channels and metadata by mapping the spatial coordinate to color space. Even though the suggested method is human-interpretable, decoding is needed for the quantitative analysis. Therefore, the time and frequency scanning method, and a histogram to estimate the DoA of the source are proposed. An experiment is conducted in an anechoic chamber to localize two quadcopter drones that have a mean angular velocity of 8 degrees/s f 9 degrees/s (95 % CI) and 25 degrees/s f 31 degrees/s (95 % CI), respectively, and the spatial sound scene analysis is implemented using the proposed methods. The test result shows that the trajectories with respect to the time of each source are well separated. Also, an additional test is conducted using an open-access audio dataset for machine learning. The cumulative source mapping method is adopted for the spatial sound scene analysis, and the decoded result shows that the DoAgram is feasible to adopt for machine learning applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multiple Sound Source Localization Based on a Multi-dimensional Assignment Model
    Dang, Xudong
    Zhu, Hongyan
    Cheng, Qi
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 1732 - 1737
  • [2] Indoor Multiple Sound Source Localization via Multi-Dimensional Assignment Data Association
    Dang, Xudong
    Cheng, Qi
    Zhu, Hongyan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 1944 - 1956
  • [3] An Interactive Visual Analysis Method for Multi-Dimensional Data Deduplication
    Zhu, Haiyang
    Qian, Zhonghao
    Yan, Fan
    Mao, Ketian
    Ying, Haojian
    Wang, Jie
    Chen, Wei
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (06): : 841 - 851
  • [4] Activity Gazer: A multi-dimensional visual representation of project performance
    Lee, Namhun
    Rojas, Eddy M.
    [J]. AUTOMATION IN CONSTRUCTION, 2014, 44 : 25 - 32
  • [5] A Differential Quadrature Method for Multi-Dimensional Inverse Heat Conduction Problem of Heat Source
    Wu, Jiun-Yu
    Chang, Chih-Wen
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2011, 25 (03): : 215 - 237
  • [6] Source Identification of Non-Stationary Sound-Vibration Signals Using Multi-Dimensional Spectral Analysis Method
    Sim, Hyoun-Jin
    Lee, Hae-Jin
    Lee, You-Yub
    Lee, Jung-Youn
    Oh, Jae-Eung
    [J]. TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2006, 30 (09) : 1154 - 1159
  • [7] Automatic Problem Localization via Multi-dimensional Metric Profiling
    Laguna, Ignacio
    Mitra, Subrata
    Arshad, Fahad A.
    Theera-Ampornpunt, Nawanol
    Zhu, Zongyang
    Bagchi, Saurabh
    Midkiff, Samuel P.
    Kistler, Mike
    Gheith, Ahmed
    [J]. 2013 IEEE 32ND INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2013), 2013, : 121 - 132
  • [8] The problem of rape in India: a multi-dimensional analysis
    Sharma, Radha R.
    Pardasani, Rupali
    Nandram, Sharda
    [J]. INTERNATIONAL JOURNAL OF MANAGING PROJECTS IN BUSINESS, 2014, 7 (03) : 362 - 379
  • [9] Unsupervised Visual Representation Learning via Multi-Dimensional Relationship Alignment
    Cheng, Haoyang
    Li, Hongliang
    Qiu, Heqian
    Wu, Qingbo
    Zhang, Xiaoliang
    Meng, Fanman
    Ngan, King Ngi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1613 - 1626
  • [10] Representation of Green's function of the Neumann problem for a multi-dimensional ball
    Sadybekov, M. T.
    Torebek, B. T.
    Turmetov, B. Kh.
    [J]. COMPLEX VARIABLES AND ELLIPTIC EQUATIONS, 2016, 61 (01) : 104 - 123