Computational auditory scene analysis and its application to robot audition: Five years experience

被引:0
|
作者
Okuno, Hiroshi G. [1 ]
Ogata, Tetsuya [1 ]
Komatani, Kazunori [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.
引用
收藏
页码:69 / +
页数:3
相关论文
共 50 条
  • [21] On ideal binary mask as the computational goal of auditory scene analysis
    Wang, DL
    SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 181 - 197
  • [22] A Computational Approach to the Dynamic Aspects of Primitive Auditory Scene Analysis
    Kashino, Makio
    Adachi, Eisuke
    Hirose, Haruto
    BASIC ASPECTS OF HEARING: PHYSIOLOGY AND PERCEPTION, 2013, 787 : 519 - 526
  • [23] Linking computational auditory scene analysis to automatic speech recognition
    Cooke, M
    Morris, A
    Green, P
    ACUSTICA, 1996, 82 : S87 - S87
  • [24] Building Health Monitoring Using Computational Auditory Scene Analysis
    Kawamoto, Mitsuru
    Hamamoto, Takuji
    16TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2020), 2020, : 144 - 146
  • [25] Computational Auditory Scene Analysis Based Voice Activity Detection
    Tu, Ming
    Xie, Xiang
    Na, Xingyu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 797 - 802
  • [26] Computational auditory scene analysis in cellular wave computing framework
    Fodroczi, Zoltan
    Radvanyi, Andras
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2006, 34 (04) : 489 - 515
  • [27] Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
    Ellis, Daniel P.W.
    Speech Communication, 1999, 27 (03): : 281 - 298
  • [28] Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
    Ellis, DPW
    SPEECH COMMUNICATION, 1999, 27 (3-4) : 281 - 298
  • [29] Improved monaural speech segregation based on computational auditory scene analysis
    Wang Yu
    Lin Jiajun
    Chen Ning
    Yuan Wenhao
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [30] Improved monaural speech segregation based on computational auditory scene analysis
    Wang Yu
    Lin Jiajun
    Chen Ning
    Yuan Wenhao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,