Computational auditory scene analysis and its application to robot audition: Five years experience

被引:0
|
作者
Okuno, Hiroshi G. [1 ]
Ogata, Tetsuya [1 ]
Komatani, Kazunori [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.
引用
收藏
页码:69 / +
页数:3
相关论文
共 50 条
  • [41] SOURCE SEPARATION WITH WEAKLY LABELLED DATA: AN APPROACH TO COMPUTATIONAL AUDITORY SCENE ANALYSIS
    Kong, Qiuglang
    Wang, Yuxuan
    Song, Xuchen
    Cao, Yin
    Wang, Wenwu
    Plumbley, Mark D.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 101 - 105
  • [42] Application of loudness/pitch/timbre decomposition operators to auditory scene analysis
    Abe, M
    Ando, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2646 - 2649
  • [43] Sound source separation via computational auditory scene analysis-enhanced beamforming
    Drake, L
    Katsaggelos, AK
    Rutledge, JC
    Zhang, J
    SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 259 - 263
  • [44] A comparison of several computational auditory scene analysis (CASA) techniques for monaural speech segregation
    Zeremdini J.
    Ben Messaoud M.A.
    Bouzid A.
    Brain Informatics, 2015, 2 (3) : 155 - 166
  • [45] A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation
    L. A. Drake
    J. C. Rutledge
    J. Zhang
    A. Katsaggelos (EURASIP Member)
    EURASIP Journal on Advances in Signal Processing, 2009
  • [46] A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation
    Drake, L. A.
    Rutledge, J. C.
    Zhang, J.
    Katsaggelos, A.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [47] Auditory scene analysis via application of ICA in a time-frequency domain
    Janku, L
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 347 - 353
  • [48] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
    Li, Peng
    Guan, Yong
    Xu, Bo
    Liu, Wenju
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 742 - +
  • [49] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
    Li, Peng
    Guan, Yong
    Xu, Bo
    Liu, Wenju
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2014 - 2023
  • [50] Cytogenetic Analysis for Suspected Chromosomal Abnormalities; A Five Years Experience
    Polipalli, Sunil Kumar
    Karra, Vijay Kumar
    Jindal, Ankur
    Puppala, Madhavi
    Singh, Pratiksha
    Rawat, Kanchan
    Kapoor, Seema
    JOURNAL OF CLINICAL AND DIAGNOSTIC RESEARCH, 2016, 10 (09) : GC1 - GC5