A Dataset and Taxonomy for Urban Sound Research

被引:614
|
作者
Salamon, Justin [1 ,2 ]
Jacoby, Christopher [1 ]
Bello, Juan Pablo [1 ]
机构
[1] NYU, Mus & Audio Res Lab, New York, NY 10003 USA
[2] NYU, Ctr Urban Sci & Progress, New York, NY 10003 USA
关键词
Urban sound; dataset; taxonomy; classification;
D O I
10.1145/2647868.2655045
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic urban sound classification is a growing area of research with applications in multimedia retrieval and urban informatics. In this paper we identify two main barriers to research in this area - the lack of a common taxonomy and the scarceness of large, real-world, annotated data. To address these issues we present a taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes. The challenges presented by the new dataset are studied through a series of experiments using a baseline classification system.
引用
收藏
页码:1041 / 1044
页数:4
相关论文
共 50 条
  • [41] HAASD: A dataset of Household Appliances Abnormal Sound Detection
    Jiang, Yong
    Li, Chunyang
    Li, Nan
    Feng, Tao
    Liu, Meilian
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 6 - 10
  • [42] A dataset of Solicited Cough Sound for Tuberculosis Triage Testing
    Sophie Huddart
    Vijay Yadav
    Solveig K. Sieberts
    Larson Omberg
    Mihaja Raberahona
    Rivo Rakotoarivelo
    Issa N. Lyimo
    Omar Lweno
    Devasahayam J. Christopher
    Nguyen Viet Nhung
    Grant Theron
    William Worodria
    Charles Y. Yu
    Christine M. Bachman
    Stephen Burkot
    Puneet Dewan
    Sourabh Kulhare
    Peter M. Small
    Adithya Cattamanchi
    Devan Jaganath
    Simon Grandjean Lapierre
    Scientific Data, 11 (1)
  • [43] Forest Sound Classification Dataset: FSC22
    Bandara, Meelan
    Jayasundara, Roshinie
    Ariyarathne, Isuru
    Meedeniya, Dulani
    Perera, Charith
    SENSORS, 2023, 23 (04)
  • [44] An empirically sound telemedicine taxonomy – applying the CAFE methodology
    Lorenz Harst
    Lena Otto
    Patrick Timpel
    Peggy Richter
    Hendrikje Lantzsch
    Bastian Wollschlaeger
    Katja Winkler
    Hannes Schlieter
    Journal of Public Health, 2022, 30 : 2729 - 2740
  • [45] REALIMPACT: A Dataset of Impact Sound Fields for Real Objects
    Clarke, Samuel
    Xu, Julia
    Gao, Ruohan
    Wang, Jui-Hsien
    Wang, Mason
    James, Doug L.
    Rau, Mark
    Wu, Jiajun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1516 - 1525
  • [46] An Indoor Sound Source Localization Dataset for Machine Learning
    Wu, Tao
    Jiang, Yong
    Li, Nan
    Feng, Tao
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 28 - 32
  • [47] CONDUCT: An Expressive Conducting Gesture Dataset for Sound Control
    Chen, Lei
    Gibet, Sylvie
    Marteau, Camille
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1719 - 1725
  • [48] Urban Sound: An Analysis of Discourses Constructed by Sound Studies
    Idrisova, Sevindzh
    LOGOS-VILNIUS, 2019, (101): : 199 - 206
  • [49] A Taxonomy of Classification Approaches in IS Research
    Gerber, Aurona
    Baskerville, Richard
    van der Merwe, Alta
    AMCIS 2017 PROCEEDINGS, 2017,
  • [50] A Taxonomy and Survey of SCTP Research
    Budzisz, Lukasz
    Garcia, Johan
    Brunstrom, Anna
    Ferrus, Ramon
    ACM COMPUTING SURVEYS, 2012, 44 (04)