Computational auditory scene analysis and its application to robot audition: Five years experience

被引：0

作者：

Okuno, Hiroshi G. ^{[1
]}

Ogata, Tetsuya ^{[1
]}

Komatani, Kazunori ^{[1
]}

机构：

[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan

来源：

ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.

引用

页码：69 / +

页数：3

共 50 条

[41] SOURCE SEPARATION WITH WEAKLY LABELLED DATA: AN APPROACH TO COMPUTATIONAL AUDITORY SCENE ANALYSIS
Kong, Qiuglang
Wang, Yuxuan
Song, Xuchen
Cao, Yin
Wang, Wenwu
Plumbley, Mark D.
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 101 - 105
[42] Application of loudness/pitch/timbre decomposition operators to auditory scene analysis
Abe, M
Ando, S
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2646 - 2649
[43] Sound source separation via computational auditory scene analysis-enhanced beamforming
Drake, L
Katsaggelos, AK
Rutledge, JC
Zhang, J
SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 259 - 263
[44] A comparison of several computational auditory scene analysis (CASA) techniques for monaural speech segregation
Zeremdini J.
Ben Messaoud M.A.
Bouzid A.
Brain Informatics, 2015, 2 (3) : 155 - 166
[45] A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation
L. A. Drake
J. C. Rutledge
J. Zhang
A. Katsaggelos (EURASIP Member)
EURASIP Journal on Advances in Signal Processing, 2009
[46] A Computational Auditory Scene Analysis-Enhanced Beamforming Approach for Sound Source Separation
Drake, L. A.
Rutledge, J. C.
Zhang, J.
Katsaggelos, A.
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[47] Auditory scene analysis via application of ICA in a time-frequency domain
Janku, L
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 347 - 353
[48] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
Li, Peng
Guan, Yong
Xu, Bo
Liu, Wenju
ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 742 - +
[49] Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
Li, Peng
Guan, Yong
Xu, Bo
Liu, Wenju
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2014 - 2023
[50] Cytogenetic Analysis for Suspected Chromosomal Abnormalities; A Five Years Experience
Polipalli, Sunil Kumar
Karra, Vijay Kumar
Jindal, Ankur
Puppala, Madhavi
Singh, Pratiksha
Rawat, Kanchan
Kapoor, Seema
JOURNAL OF CLINICAL AND DIAGNOSTIC RESEARCH, 2016, 10 (09) : GC1 - GC5

← 1 2 3 4 5 →