Computational auditory scene analysis and its application to robot audition: Five years experience

被引：0

作者：

Okuno, Hiroshi G. ^{[1
]}

Ogata, Tetsuya ^{[1
]}

Komatani, Kazunori ^{[1
]}

机构：

[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan

来源：

ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We have been engaged in research on computational auditory scene analysis to attain sophisticated robot/computer human interaction by manipulating real-world sound signals. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears, that is, microphones embedded in the robot. We have coped with three main issues in computational auditory scene analysis, that is, sound source localization, separation, and recognition of separated sounds for a mixture of speech signals as well as polyphonic music signals. This paper overviews our results in robot audition, in particular, Missing Feature Theory based integration of sound source separation and automatic speech recognition, and those in music information processing, in particular, drum sound equalizer.

引用

页码：69 / +

页数：3

共 50 条

[21] On ideal binary mask as the computational goal of auditory scene analysis
Wang, DL
SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 181 - 197
[22] A Computational Approach to the Dynamic Aspects of Primitive Auditory Scene Analysis
Kashino, Makio
Adachi, Eisuke
Hirose, Haruto
BASIC ASPECTS OF HEARING: PHYSIOLOGY AND PERCEPTION, 2013, 787 : 519 - 526
[23] Linking computational auditory scene analysis to automatic speech recognition
Cooke, M
Morris, A
Green, P
ACUSTICA, 1996, 82 : S87 - S87
[24] Building Health Monitoring Using Computational Auditory Scene Analysis
Kawamoto, Mitsuru
Hamamoto, Takuji
16TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2020), 2020, : 144 - 146
[25] Computational Auditory Scene Analysis Based Voice Activity Detection
Tu, Ming
Xie, Xiang
Na, Xingyu
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 797 - 802
[26] Computational auditory scene analysis in cellular wave computing framework
Fodroczi, Zoltan
Radvanyi, Andras
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2006, 34 (04) : 489 - 515
[27] Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
Ellis, Daniel P.W.
Speech Communication, 1999, 27 (03): : 281 - 298
[28] Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures
Ellis, DPW
SPEECH COMMUNICATION, 1999, 27 (3-4) : 281 - 298
[29] Improved monaural speech segregation based on computational auditory scene analysis
Wang Yu
Lin Jiajun
Chen Ning
Yuan Wenhao
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[30] Improved monaural speech segregation based on computational auditory scene analysis
Wang Yu
Lin Jiajun
Chen Ning
Yuan Wenhao
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,

← 1 2 3 4 5 →