Sound Event Detection: A tutorial

被引:74
|
作者
Mesaros, Annamaria [1 ]
Heittola, Toni [2 ]
Virtanen, Tuomas [3 ]
Plumbley, Mark D. [4 ,5 ]
机构
[1] Tampere Univ, Machine Listening Grp, Korkeakoulunkatu 33014, Finland
[2] Tampere Univ, Korkeakoulunkatu 33014, Finland
[3] Tampere Univ, Audio Res Grp, Korkeakoulunkatu 33014, Finland
[4] Univ Surrey, Ctr Vis Speech & Signal Proc, Signal Proc, Guildford GU2 7XH, Surrey, England
[5] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England
基金
欧洲研究理事会; 芬兰科学院; 英国工程与自然科学研究理事会;
关键词
AUDIO;
D O I
10.1109/MSP.2021.3090678
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Imagine standing on a street corner in the city. With your eyes closed you can hear and recognize a succession of sounds: cars passing by, people speaking, their footsteps when they walk by, and the continuous falling of rain. The recognition of all these sounds and interpretation of the perceived scene as a city street soundscape comes naturally to humans. It is, however, the result of years of "training": encountering and learning associations among the vast varieties of sounds in everyday life, the sources producing these sounds, and the names given to them.
引用
收藏
页码:67 / 83
页数:17
相关论文
共 50 条
  • [1] Event Specific Attention for Polyphonic Sound Event Detection
    Sundar, Harshavardhan
    Sun, Ming
    Wang, Chao
    INTERSPEECH 2021, 2021, : 566 - 570
  • [2] Metrics for Polyphonic Sound Event Detection
    Mesaros, Annamaria
    Heittola, Toni
    Virtanen, Tuomas
    APPLIED SCIENCES-BASEL, 2016, 6 (06):
  • [3] Improving sound event detection with ontologies
    Raj, Bhiksha
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [4] Active Learning for Sound Event Detection
    Shuyang Zhao
    Heittola, Toni
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2895 - 2905
  • [5] A Mobile Application for Sound Event Detection
    Fu, Yingwei
    Xu, Kele
    Mi, Haibo
    Wang, Huaimin
    Wang, Dezhi
    Zhu, Boqing
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6515 - 6517
  • [6] Capsule Routing for Sound Event Detection
    Iqbal, Turab
    Xu, Yong
    Kong, Qiuqiang
    Wang, Wenwu
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2255 - 2259
  • [7] Sound-Event Partitioning and Feature Normalization for Robust Sound-Event Detection
    Lei, Baiying
    Mak, Man-Wai
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 389 - 394
  • [8] Robust scream sound detection via sound event partitioning
    Baiying Lei
    Man-Wai Mak
    Multimedia Tools and Applications, 2016, 75 : 6071 - 6089
  • [9] Robust scream sound detection via sound event partitioning
    Lei, Baiying
    Mak, Man-Wai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (11) : 6071 - 6089
  • [10] Real-time Event Detection for Emergency Response Tutorial
    Jaimes, Alejandro
    Tetreault, Joel
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4042 - 4043