Modelling representations in speech normalization of prosodic cues

被引:0
|
作者
Chen Si
Caicai Zhang
Puiyin Lau
Yike Yang
Bei Li
机构
[1] The Hong Kong Polytechnic University,Department of Chinese and Bilingual Studies
[2] Hong Kong Polytechnic University-Peking University Research Centre on Chinese Linguistics,Research Centre for Language, Cognition, and Neuroscience
[3] University of Hong Kong,Department of Statistics and Actuarial Science
[4] University of Hong Kong,Department of Chinese Language and Literature
[5] Hong Kong Shue Yan University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The lack of invariance problem in speech perception refers to a fundamental problem of how listeners deal with differences of speech sounds produced by various speakers. The current study is the first to test the contributions of mentally stored distributional information in normalization of prosodic cues. This study starts out by modelling distributions of acoustic cues from a speech corpus. We proceeded to conduct three experiments using both naturally produced lexical tones with estimated distributions and manipulated lexical tones with f0 values generated from simulated distributions. State of the art statistical techniques have been used to examine the effects of distribution parameters in normalization and identification curves with respect to each parameter. Based on the significant effects of distribution parameters, we proposed a probabilistic parametric representation (PPR), integrating knowledge from previously established distributions of speakers with their indexical information. PPR is still accessed during speech perception even when contextual information is present. We also discussed the procedure of normalization of speech signals produced by unfamiliar talker with and without contexts and the access of long-term stored representations.
引用
收藏
相关论文
共 50 条
  • [31] A Comparison of Normalization Techniques Applied to Latent Space Representations for Speech Analytics
    Morchid, Mohamed
    Dufour, Richard
    Matrouf, Driss
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 145 - 149
  • [32] The Effects of Age and Infant Hearing Status on Maternal Use of Prosodic Cues for Clause Boundaries in Speech
    Kondaurova, Maria V.
    Bergeson, Tonya R.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2011, 54 (03): : 740 - 754
  • [33] Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis
    James, Jesin
    Balamurali, B. T.
    Watson, Catherine
    Mixdorff, Hansjoerg
    SENSORS, 2023, 23 (06)
  • [34] Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders
    Benway, Nina R.
    Preston, Jonathan L.
    Salekin, Asif
    Hitchcock, Elaine
    McAllister, Tara
    JASA EXPRESS LETTERS, 2024, 4 (02):
  • [35] Word segmentation with universal prosodic cues
    Endress, Ansgar D.
    Hauser, Marc D.
    COGNITIVE PSYCHOLOGY, 2010, 61 (02) : 177 - 199
  • [36] Interpreting prosodic cues in discourse context
    Brown, Meredith
    Salverda, Anne Pier
    Gunlogson, Christine
    Tanenhaus, Michael K.
    LANGUAGE COGNITION AND NEUROSCIENCE, 2015, 30 (1-2) : 149 - 166
  • [37] Developmental changes in the weighting of prosodic cues
    Seidl, Amanda
    Cristia, Alejandrina
    DEVELOPMENTAL SCIENCE, 2008, 11 (04) : 596 - 606
  • [38] Interactivity in prosodic representations in children
    Goffman, Lisa
    Westover, Stefanie
    JOURNAL OF CHILD LANGUAGE, 2013, 40 (05) : 1032 - 1056
  • [39] Processing Prosodic Cues with Cochlear Implants
    Meister, H.
    SPRACHE-STIMME-GEHOR, 2011, 35 (03): : E99 - E104
  • [40] PROSODIC AND NON-PROSODIC COHESION IN SPEECH AND WRITING
    DAVIES, M
    WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1989, 40 (1-2): : 255 - 262