Human-machine collaboration based sound event detection

被引:1
|
作者
Ge, Shengtong [1 ]
Yu, Zhiwen [1 ]
Yang, Fan [2 ]
Liu, Jiaqi [1 ]
Wang, Liang [2 ]
机构
[1] Northwestern Polytech Univ, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Sound event detection; Human-machine collaboration; Deep learning; Semi-supervised learning;
D O I
10.1007/s42486-022-00091-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sound Event Detection (SED) is the task of detecting and demarcating the segments with specific semantics in audio recording. It has a promising application prospect in security monitoring, intelligent medical treatment, industrial production and so on. However, SED is still in the early stage of development and it faces many challenges, including the lack of accurately annotated data and the poor performance on detection due to the overlap of sound events. In view of the above problems, considering the intelligence of human beings and their flexibility and adaptability in the face of complex problems and changing environment, this paper proposes an approach of human-machine collaboration based SED (HMSED). In order to reduce the cost of labeling data, we first employ two CNN models with embedding-level attention pool module for weakly-labeled SED. Second, in order to improve the abilities of these two models alternately, we propose an end-to-end guided learning process for semi-supervised learning. Third, we use a group of median filters with adaptive window size in the post-processing of output probabilities of the model. Fourth, the model is adjusted and optimized by combining the results of machine recognition and manual annotation feedback. Based on HTML and JavaScript, an interactive annotation interface for HMSED is developed. And we do extensive exploratory experiments on the effects of human workload, model structure, hyperparameter and adaptive post-processing. The result shows that the HMSED is superior to some classical SED approaches.
引用
收藏
页码:158 / 171
页数:14
相关论文
共 50 条
  • [41] Intelligent Information Design Based on Human-Machine Collaboration in Lane Change Overtaking Scenarios
    Wang, Jianmin
    Cui, Xinyi
    Fu, Qianwen
    Wang, Yuchen
    You, Fang
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION, PT I, HIMI 2024, 2024, 14689 : 81 - 96
  • [42] KGMM - A Maturity Model for Scholarly Knowledge Graphs Based on Intertwined Human-Machine Collaboration
    Hussein, Hassan
    Oelen, Allard
    Karras, Oliver
    Auer, Soeren
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 253 - 269
  • [43] Towards Human-Machine Collaboration: Multimodal Group Potency Estimation
    Corbellini, Nicola
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 685 - 689
  • [44] Human-machine collaboration to disambiguate entities in unstructured text datasets
    Davenport, Jack H.
    NEXT-GENERATION ANALYST VI, 2018, 10653
  • [45] Intermediate deep feature coding for human-machine vision collaboration
    Wang, Weiqian
    An, Ping
    Huang, Xinpeng
    Huang, Kunqiang
    Yang, Chao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [46] Modelling cognitive and affective load for the design of human-machine collaboration
    Neerincx, Mark A.
    ENGINEERING PSYCHOLOGY AND COGNITIVE ERGONOMICS, PROCEEDINGS, 2007, 4562 : 568 - 574
  • [47] Human-machine collaboration for enhanced decision-making in governance
    Van Rooy, Dirk
    DATA & POLICY, 2024, 6
  • [48] Best of both worlds: human-machine collaboration for object annotation
    Russakovsky, Olga
    Lie, Li-Jia
    Li Fei-Fei
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2121 - 2131
  • [49] A Human-Machine Collaboration Model for Urban Planning in Smart Cities
    Meza, Jaime
    Vaca-Cardenas, Leticia
    Elva Vaca-Cardenas, Monica
    Teran, Luis
    Portmann, Edy
    COMPUTER, 2021, 54 (06) : 24 - 35
  • [50] A Vision for Human-Machine Mutual Understanding, Trust Establishment, and Collaboration
    Azevedo, Carlos R. B.
    Raizer, Klaus
    Souza, Ricardo
    2017 IEEE CONFERENCE ON COGNITIVE AND COMPUTATIONAL ASPECTS OF SITUATION MANAGEMENT (COGSIMA), 2017,