A Dataset and Taxonomy for Urban Sound Research

被引:614
|
作者
Salamon, Justin [1 ,2 ]
Jacoby, Christopher [1 ]
Bello, Juan Pablo [1 ]
机构
[1] NYU, Mus & Audio Res Lab, New York, NY 10003 USA
[2] NYU, Ctr Urban Sci & Progress, New York, NY 10003 USA
关键词
Urban sound; dataset; taxonomy; classification;
D O I
10.1145/2647868.2655045
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic urban sound classification is a growing area of research with applications in multimedia retrieval and urban informatics. In this paper we identify two main barriers to research in this area - the lack of a common taxonomy and the scarceness of large, real-world, annotated data. To address these issues we present a taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes. The challenges presented by the new dataset are studied through a series of experiments using a baseline classification system.
引用
收藏
页码:1041 / 1044
页数:4
相关论文
共 50 条
  • [1] Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset
    Viveros-Munoz, Rhoddy
    Huijse, Pablo
    Vargas, Victor
    Espejo, Diego
    Poblete, Victor
    Arenas, Jorge P.
    Vernier, Matthieu
    Vergara, Diego
    Suarez, Enrique
    DATA IN BRIEF, 2023, 50
  • [2] URBAN SOUND & SIGHT: DATASET AND BENCHMARK FOR AUDIO-VISUAL URBAN SCENE UNDERSTANDING
    Fuentes, Magdalena
    Steers, Bea
    Zinemanas, Pablo
    Rocamora, Martin
    Bondi, Luca
    Wilkins, Julia
    Shi, Qianyi
    Hou, Yao
    Das, Samarjit
    Serra, Xavier
    Bello, Juan Pablo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 141 - 145
  • [3] BCNDataset: Description and Analysis of an Annotated Night Urban Leisure Sound Dataset
    Vidana-Vila, Ester
    Duboc, Leticia
    Alsina-Pages, Rosa Ma
    Polls, Francesc
    Vargas, Harold
    SUSTAINABILITY, 2020, 12 (19)
  • [4] The Audio-Visual BatVision Dataset for Research on Sight and Sound
    Brunetto, Amandine
    Hornauer, Sascha
    Yu, Stella X.
    Moutarde, Fabien
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5812 - 5819
  • [5] A Taxonomy and Dataset for 360° Videos
    Nasrabadi, Afshin Taghavi
    Samiei, Aliehsan
    Mahzari, Anahita
    McMahan, Ryan P.
    Prakash, Ravi
    Farias, Mylene C. Q.
    Carvalho, Marcelo M.
    PROCEEDINGS OF THE 10TH ACM MULTIMEDIA SYSTEMS CONFERENCE (ACM MMSYS'19), 2019, : 273 - 278
  • [6] Sound and music computing taxonomy
    McGee, W
    COMPUTER MUSIC JOURNAL, 1997, 21 (01) : 9 - 10
  • [7] Sound Reflections The taxonomy of echoes
    Cox, Trevor
    NATURAL HISTORY, 2013, 121 (10) : 24 - 31
  • [8] A taxonomy of sound sources in restaurants
    Lindborg, PerMagnus
    APPLIED ACOUSTICS, 2016, 110 : 297 - 310
  • [9] A TAXONOMY FOR SOUND AND MUSIC COMPUTING
    CAMURRI, A
    DEPOLI, G
    ROCCHESSO, D
    COMPUTER MUSIC JOURNAL, 1995, 19 (02) : 4 - 5
  • [10] Research on Sound Field Evaluation of Urban Walking Street Through Sound Environment Perception
    School of Architecture, Feng Chia University, Taichung, Taiwan
    不详
    Signals Commun. Technol., (63-74):