Varying microphone patterns for meeting speech segmentation using spatial audio cues

被引:0
|
作者
Cheng, Eva [1 ]
Burnett, Ian [1 ]
Ritz, Christian [1 ]
机构
[1] Univ Wollongong, Whisper Labs, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2522, Australia
关键词
spatial audio cues; meeting audio analysis; microphone arrays;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meetings, common to many business environments, generally involve stationary participants. Thus, participant location information can be used to segment meeting speech recordings into each speaker's 'turn'. The authors' previous work proposed the use of spatial audio cues to represent the speaker locations. This paper studies the validity of using spatial audio cues for meeting speech segmentation by investigating the effect of varying microphone pattern on the spatial cues. Experiments conducted on recordings of a real acoustic environment indicate that the relationship between speaker location and spatial audio cues strongly depends on the microphone pattern.
引用
收藏
页码:221 / +
页数:2
相关论文
共 49 条
  • [1] Using spatial audio cues from speech excitation for meeting speech segmentation
    Cheng, Eva
    Burnett, Ian
    Ritz, Christian
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3067 - +
  • [2] Using spatial cues for meeting speech segmentation
    Cheng, E
    Lukasiak, J
    Burnett, IS
    Stirling, D
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 350 - 353
  • [3] RECURRENT SPEECH PATTERNS AS CUES TO THE SEGMENTATION OF MULTISYLLABIC SEQUENCES
    COWAN, N
    [J]. ACTA PSYCHOLOGICA, 1991, 77 (02) : 121 - 135
  • [4] Automatic speech recognition using audio visual cues
    Yashwanth, H
    Mahendrakar, H
    David, S
    [J]. PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 166 - 169
  • [5] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
    Rybach, David
    Gollan, Christian
    Schlueter, Ralf
    Ney, Hermann
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
  • [6] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
    Asano, Futoshi
    Yamamoto, Kiyoshi
    Ogata, Jun
    Yamada, Miichi
    Nakamura, Andmasami
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [7] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
    Futoshi Asano
    Kiyoshi Yamamoto
    Jun Ogata
    Miichi Yamada
    Masami Nakamura
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [8] Multi- Microphone Speech Dereverberation using Spatial Filtering
    Deshpande, Sandhya R.
    Deshpande, Mangesh S.
    [J]. 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 340 - 343
  • [9] Indoor/Outdoor Audio Classification using Foreground Speech Segmentation
    Khonglah, Banriskhem K.
    Deepak, K. T.
    Prasanna, S. R. Mahadeva
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 464 - 468
  • [10] Unsupervised Segmentation of Audio Speech Using the Voting Experts Algorithm
    Miller, Matthew
    Wong, Peter
    Stoytchev, Alexander
    [J]. ARTIFICIAL GENERAL INTELLIGENCE PROCEEDINGS, 2009, 8 : 138 - 143