Computational auditory scene analysis and its application to robot audition: Five years experience

被引:0
|
作者
Okuno, Hiroshi G. [1 ]
Ogata, Tetsuya [1 ]
Komatani, Kazunori [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.
引用
收藏
页码:69 / +
页数:3
相关论文
共 50 条
  • [1] Computational auditory scene analysis and its application to robot audition
    Okuno, Hiroshi G.
    Nakadai, Kazuhiro
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 125 - +
  • [2] Computational auditory scene analysis and its application to robot audition
    Okuno, HG
    Ogata, T
    Komatani, K
    Nakadai, K
    INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS, 2004, : 73 - 80
  • [3] Robot Audition and Computational Auditory Scene Analysis
    Nakadai, Kazuhiro
    Okuno, Hiroshi G.
    ADVANCED INTELLIGENT SYSTEMS, 2020, 2 (09)
  • [4] Robot audition from the viewpoint of computational auditory scene analysis
    Okuno, Hiroshi G.
    Ogata, Tetsuya
    Komatani, Kazunori
    INTERNATIONAL CONFERENCE ON INFORMATICS EDUCATION AND RESEARCH FOR KNOWLEDGE-CIRCULATING SOCIETY, PROCEEDINGS, 2008, : 35 - 40
  • [5] COMPUTATIONAL AUDITORY SCENE ANALYSIS
    BROWN, GJ
    COOKE, M
    COMPUTER SPEECH AND LANGUAGE, 1994, 8 (04): : 297 - 336
  • [6] DEVELOPMENT OF ZONAL BEAMFORMER AND ITS APPLICATION TO ROBOT AUDITION
    Tanaka, Nobuaki
    Ogawa, Tetsuji
    Akagiri, Kenzo
    Kobayashi, Tetsunori
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1529 - 1533
  • [7] A blackboard architecture for computational auditory scene analysis
    Godsmark, D
    Brown, GJ
    SPEECH COMMUNICATION, 1999, 27 (3-4) : 351 - 366
  • [8] Computational Models of Auditory Scene Analysis: A Review
    Szabo, Beata T.
    Denham, Susan L.
    Winkler, Istvan
    FRONTIERS IN NEUROSCIENCE, 2016, 10
  • [9] Sound ontology for computational auditory scene analysis
    Nakatani, T
    Okuno, HG
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 1004 - 1010
  • [10] Blackboard architecture for computational auditory scene analysis
    Godsmark, Darryl
    Brown, Guy J.
    Speech Communication, 1999, 27 (03): : 351 - 366