Syllable Segmentation of Continuous Speech Using Auditory Attention Cues

被引:0
|
作者
Kalinli, Ozlem [1 ]
机构
[1] Sony Comp Entertainment, US R&D, Foster City, CA USA
关键词
syllabification; syllable boundary prediction; syllable nuclei detection; auditory attention; auditory gist;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmentation of speech into syllables is beneficial for many spoken language processing applications since it provides information about phonological and rhythmic aspects of speech. Traditional methods usually detect syllable nuclei using features such as energies in critical bands, linear predictive coding spectra, pitch, voicing, etc. Here, a novel system that uses auditory attention cues is proposed for predicting syllable boundaries. The auditory attention cues are biologically inspired and capture changes in sound characteristic by using 2D spectro-temporal receptive filters. When tested on TIMIT, it is shown that the proposed method successfully predicts syllable boundaries and performs as good as or better than the state-of-the art syllable nucleus detection methods.
引用
收藏
页码:432 / 435
页数:4
相关论文
共 50 条
  • [1] Visual speech segmentation: using facial cues to locate word boundaries in continuous speech
    Mitchel, Aaron D.
    Weiss, Daniel J.
    [J]. LANGUAGE COGNITION AND NEUROSCIENCE, 2014, 29 (07) : 771 - 780
  • [2] SYLLABLE DETECTION IN CONTINUOUS SPEECH
    SARGENT, DC
    LI, KP
    FU, KS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02): : 410 - 410
  • [3] Newborns are sensitive to multiple cues for word segmentation in continuous speech
    Flo, Ana
    Brusini, Perrine
    Macagno, Francesco
    Nespor, Marina
    Mehler, Jacques
    Ferry, Alissa L.
    [J]. DEVELOPMENTAL SCIENCE, 2019, 22 (04)
  • [4] Using spatial cues for meeting speech segmentation
    Cheng, E
    Lukasiak, J
    Burnett, IS
    Stirling, D
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 350 - 353
  • [5] MORA OR SYLLABLE - SPEECH SEGMENTATION IN JAPANESE
    OTAKE, T
    HATANO, G
    CUTLER, A
    MEHLER, J
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 1993, 32 (02) : 258 - 278
  • [6] THE CAPTURE OF VISUAL ATTENTION USING AUDITORY CUES IN SCHIZOPHRENIA
    Kean, Matthew
    Crawford, Trevor
    Wolohan, Felicity
    Kumari, Veena
    Ettinger, Ulrich
    [J]. SCHIZOPHRENIA RESEARCH, 2010, 117 (2-3) : 250 - 250
  • [7] Ear-EEG Measures of Auditory Attention to Continuous Speech
    Holtze, Bjoern
    Rosenkranz, Marc
    Jaeger, Manuela
    Debener, Stefan
    Mirkovic, Bojana
    [J]. FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [8] A Saliency-Based Auditory Attention Model with Applications to Unsupervised Prominent Syllable Detection in Speech
    Kalinli, Ozlem
    Narayanan, Shrikanth
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2452 - 2455
  • [9] Syllable Based Continuous Speech Recognizer With Varied Length Maximum Likelihood Character Segmentation
    Ganesh, Akila A.
    Ravichandran, Chandra
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 935 - 940
  • [10] Segmentation of continuous speech using phonotactics
    McQueen, JM
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 1998, 39 (01) : 21 - 46