Using spatial cues for meeting speech segmentation

被引:0
|
作者
Cheng, E [1 ]
Lukasiak, J [1 ]
Burnett, IS [1 ]
Stirling, D [1 ]
机构
[1] Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2500, Australia
关键词
D O I
10.1109/ICME.2005.1521432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work investigates the validity and accuracy of using spatial cues with Time-Delay Estimation (TDE) as a method of segmenting multichannel recorded speech by speaker location. In environments such as meetings where speakers do not significantly alter position, segmentation by speaker location essentially leads to segmentation by speaker 'turn'. The proposed system calculates location information using TDEs and spatial cues extracted from multichannel meeting audio recordings. This location information is then input into a simple segmentation algorithm. Experiments have been performed on both theoretical and real meeting recordings with non-overlapping speakers, and theoretical recordings with overlapping speakers. Segmentation results reveal the most robust cue to be a combination of spatial information and TDEs. This cue combination leads to greater segmentation accuracy for classifying individual speakers and detecting overlapping sections than using spatial cues or time-delay information alone.
引用
收藏
页码:350 / 353
页数:4
相关论文
共 50 条
  • [31] Prosodic cues enhance rule learning by changing speech segmentation mechanisms
    de Diego-Balaguer, Ruth
    Rodriguez-Fornells, Antoni
    Bachoud-Levi, Anne-Catherine
    [J]. FRONTIERS IN PSYCHOLOGY, 2015, 6
  • [32] F0 slope and mean: cues to speech segmentation in French
    Cordero, Maria del Mar
    Meunier, Fanny
    Grimault, Nicolas
    Pota, Stephane
    Spinelli, Elsa
    [J]. INTERSPEECH 2020, 2020, : 1610 - 1614
  • [33] Neural correlates of acoustic and semantic cues during speech segmentation in French
    Cordero, Maria del Mar
    Denis-Noel, Ambre
    Spinelli, Elsa
    Meunier, Fanny
    [J]. INTERSPEECH 2022, 2022, : 4058 - 4062
  • [34] LEXICAL SEGMENTATION CUES IN SPONTANEOUS SPEECH - MORE QUESTIONS THAN ANSWERS
    SCHU, J
    STEIN, S
    [J]. DEUTSCHE SPRACHE, 1994, 22 (03): : 241 - 260
  • [35] Rhythmic cues and possible-word constraints in Japanese speech segmentation
    McQueen, JM
    Otake, T
    Cutler, A
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2001, 45 (01) : 103 - 132
  • [36] Syntactic Cues Take Precedence Over Distributional Cues in Native and Non-Native Speech Segmentation
    Tremblay, Annie
    Spinelli, Elsa
    Coughlin, Caitlin E.
    Namjoshi, Jui
    [J]. LANGUAGE AND SPEECH, 2018, 61 (04) : 615 - 631
  • [37] Binaural Speech Separation of Moving Speakers With Preserved Spatial Cues
    Han, Cong
    Luo, Yi
    Mesgarani, Nima
    [J]. INTERSPEECH 2021, 2021, : 3505 - 3509
  • [38] Harmonic cues for speech segmentation: a cross-linguistic corpus study on child-directed speech
    Ketrez, F. Nihan
    [J]. JOURNAL OF CHILD LANGUAGE, 2014, 41 (02) : 439 - 461
  • [39] Jordanian EFL Students’ Perception of Noncontrastive Allophonic Cues in English Speech Segmentation
    Ghaleb Rabab’ah
    Sara Kessar
    Nimer Abusalim
    [J]. Journal of Psycholinguistic Research, 2023, 52 : 1455 - 1469
  • [40] The impact of attention load on the use of statistical information and coarticulation as speech segmentation cues
    Fernandes, Tania
    Kolinsky, Regine
    Ventura, Paulo
    [J]. ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (06) : 1522 - 1532