Attention to clapping - A direct method for detecting sound source from video and audio

被引:3
|
作者
Ikeda, T [1 ]
Ishiguro, IE [1 ]
Asada, M [1 ]
机构
[1] Osaka Univ, Grad Sch Engn, Dept Adapt Machine Syst, Suita, Osaka 5650871, Japan
关键词
D O I
10.1109/MFI-2003.2003.1232668
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The research approaches utilizing ubiquitous sensors to support human activities have become of major interest lately. One of the required features of the ubiquitous sensor system is paying its attention to our signals, such as clapping hands and uttering keywords. To detect and localize these signs, it is useful to fuse visual and audio information. The sensor fusion in previous works is performed in the task-level layer through individual representations of the sensors. Therefore, it does not provide new information by fusing sensors. This paper proposes another method that fuses sensory signals based on mutual information maximization in the signal-level layer The fused signal provides us new information that cannot be obtained from individual sensors. As an example, this paper shows two experimental results of a sound source localization by audio-visual fusion.
引用
收藏
页码:264 / 268
页数:5
相关论文
共 50 条
  • [31] Efficient method for detecting targets from remote sensing images based on global attention mechanism
    Gao, Zijun
    Su, Jingwen
    Li, Bo
    Wang, Jue
    Song, Zhankui
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [32] A method for estimating the orientation of a directional sound source from source directivity and multi-microphone recordings: Principles and application
    Guarato, Francesco
    Jakobsen, Lasse
    Vanderelst, Dieter
    Surlykke, Annemarie
    Hallam, John
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (02): : 1046 - 1058
  • [33] Method for measuring the low-frequency sound power from a complex sound source based on sound-field correction in a non-anechoic tank
    徐宏哲
    李琪
    唐锐
    尚大晶
    Chinese Physics B, 2023, 32 (05) : 578 - 593
  • [34] Method for measuring the low-frequency sound power from a complex sound source based on sound-field correction in a non-anechoic tank
    Xu, Hongzhe
    Li, Qi
    Tang, Rui
    Shang, Dajing
    CHINESE PHYSICS B, 2023, 32 (05)
  • [35] A PROBABILISTIC EVALUATION METHOD FOR THE EFFECT OF DIRECT SOUND ON THE DIFFUSENESS OF REVERBERANT SOUND FIELD FROM THE VIEWPOINT OF AN N-DIMENSIONAL SIGNAL SPACE
    OHTA, M
    MIYATA, S
    ACUSTICA, 1985, 58 (02): : 75 - 82
  • [36] Development of the numerical method for calculating sound radiation from a rotating dipole source in an opened thin duct
    Choi, Han-Lim
    Lee, Duck Joo
    JOURNAL OF SOUND AND VIBRATION, 2006, 295 (3-5) : 739 - 752
  • [37] How to Annotate Freezing of Gait from Video: A Standardized Method Using Open-Source Software
    Gilat, Moran
    JOURNAL OF PARKINSONS DISEASE, 2019, 9 (04) : 821 - 824
  • [38] Direct Method for Reconstructing the Radiating Part of a Planar Source from Its Far-Fields
    Xiao, Gaobiao
    Liu, Rui
    ELECTRONICS, 2022, 11 (23)
  • [39] A Detecting and Compensation Method for the Errors from Broken Ground Control Points at the Application of Direct Geo-referencing
    Liu, Tong
    Xu, Guochang
    Yan, Wenlin
    Xu, Tianhe
    2017 FORUM ON COOPERATIVE POSITIONING AND SERVICE (CPGPS), 2017, : 174 - 178
  • [40] On Explainable Closed-Set Source Device Identification Using Log-Mel Spectrograms From Video' Audio: A Grad-CAM Approach
    Korgialas, Christos
    Tzolopoulos, Georgios
    Kotropoulos, Constantine
    IEEE ACCESS, 2024, 12 : 121822 - 121836