Human-machine collaboration based sound event detection

被引:1
|
作者
Ge, Shengtong [1 ]
Yu, Zhiwen [1 ]
Yang, Fan [2 ]
Liu, Jiaqi [1 ]
Wang, Liang [2 ]
机构
[1] Northwestern Polytech Univ, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Sound event detection; Human-machine collaboration; Deep learning; Semi-supervised learning;
D O I
10.1007/s42486-022-00091-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sound Event Detection (SED) is the task of detecting and demarcating the segments with specific semantics in audio recording. It has a promising application prospect in security monitoring, intelligent medical treatment, industrial production and so on. However, SED is still in the early stage of development and it faces many challenges, including the lack of accurately annotated data and the poor performance on detection due to the overlap of sound events. In view of the above problems, considering the intelligence of human beings and their flexibility and adaptability in the face of complex problems and changing environment, this paper proposes an approach of human-machine collaboration based SED (HMSED). In order to reduce the cost of labeling data, we first employ two CNN models with embedding-level attention pool module for weakly-labeled SED. Second, in order to improve the abilities of these two models alternately, we propose an end-to-end guided learning process for semi-supervised learning. Third, we use a group of median filters with adaptive window size in the post-processing of output probabilities of the model. Fourth, the model is adjusted and optimized by combining the results of machine recognition and manual annotation feedback. Based on HTML and JavaScript, an interactive annotation interface for HMSED is developed. And we do extensive exploratory experiments on the effects of human workload, model structure, hyperparameter and adaptive post-processing. The result shows that the HMSED is superior to some classical SED approaches.
引用
收藏
页码:158 / 171
页数:14
相关论文
共 50 条
  • [1] Human–machine collaboration based sound event detection
    Shengtong Ge
    Zhiwen Yu
    Fan Yang
    Jiaqi Liu
    Liang Wang
    CCF Transactions on Pervasive Computing and Interaction, 2022, 4 : 158 - 171
  • [2] Human-Machine Collaboration Based Named Entity Recognition
    Ren, Zhuoli
    Yu, Zhiwen
    Wang, Hui
    Wang, Liang
    Liu, Jiaqi
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2021, PT I, 2022, 1491 : 342 - 355
  • [3] Intelligent Perception of CNC Machine Tools based on Human-machine Collaboration
    Lou, Ping
    Wei, Shijie
    Yan, Junwei
    Hu, Jiwei
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 260 - 265
  • [4] Human-Machine Collaboration in the Teaching of Proof
    Hanna, Gila
    Larvor, Brendan P.
    Yan, Xiaoheng
    JOURNAL OF HUMANISTIC MATHEMATICS, 2023, 13 (01): : 99 - 117
  • [5] An approach to human-machine collaboration in innovation
    McCaffrey, Tony
    Spector, Lee
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2018, 32 (01): : 1 - 15
  • [6] Human-Machine Collaboration for Face Recognition
    Ravindranath, Saurabh
    Baburaj, Rahul
    Balasubramanian, Vineeth N.
    Namburu, NageswaraRao
    Gujar, Sujit
    Jawahar, C., V
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 10 - 18
  • [7] Craniotomy Robot System Based on Human-Machine Parallel Collaboration
    Zhan, Yue
    Duan, Xing-guang
    Cui, Teng-fei
    Han, Ding-qiang
    2016 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2016, : 1119 - 1124
  • [8] Human-machine security collaboration based on virtual collision sensor
    Zhang, Jianhua
    Zhou, Hao
    Zhao, Yan
    Ci, Liwei
    Lu, Yang
    Zhang, Yaonan
    Liu, Xuan
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1779 - 1785
  • [9] Novel Bricks: A Scenario of Human-Machine Collaboration
    Yuan, Philip F.
    Li, Keke
    ARCHITECTURAL DESIGN, 2020, 90 (05) : 22 - 29
  • [10] Engineering Human-Machine Teams for Trusted Collaboration
    Alhaji, Basel
    Beecken, Janine
    Ehlers, Ruediger
    Gertheiss, Jan
    Merz, Felix
    Mueller, Joerg P.
    Prilla, Michael
    Rausch, Andreas
    Reinhardt, Andreas
    Reinhardt, Delphine
    Rembe, Christian
    Rohweder, Niels-Ole
    Schwindt, Christoph
    Westphal, Stephan
    Zimmermann, Juergen
    BIG DATA AND COGNITIVE COMPUTING, 2020, 4 (04) : 1 - 30