Modelling representations in speech normalization of prosodic cues

被引:0
|
作者
Chen Si
Caicai Zhang
Puiyin Lau
Yike Yang
Bei Li
机构
[1] The Hong Kong Polytechnic University,Department of Chinese and Bilingual Studies
[2] Hong Kong Polytechnic University-Peking University Research Centre on Chinese Linguistics,Research Centre for Language, Cognition, and Neuroscience
[3] University of Hong Kong,Department of Statistics and Actuarial Science
[4] University of Hong Kong,Department of Chinese Language and Literature
[5] Hong Kong Shue Yan University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The lack of invariance problem in speech perception refers to a fundamental problem of how listeners deal with differences of speech sounds produced by various speakers. The current study is the first to test the contributions of mentally stored distributional information in normalization of prosodic cues. This study starts out by modelling distributions of acoustic cues from a speech corpus. We proceeded to conduct three experiments using both naturally produced lexical tones with estimated distributions and manipulated lexical tones with f0 values generated from simulated distributions. State of the art statistical techniques have been used to examine the effects of distribution parameters in normalization and identification curves with respect to each parameter. Based on the significant effects of distribution parameters, we proposed a probabilistic parametric representation (PPR), integrating knowledge from previously established distributions of speakers with their indexical information. PPR is still accessed during speech perception even when contextual information is present. We also discussed the procedure of normalization of speech signals produced by unfamiliar talker with and without contexts and the access of long-term stored representations.
引用
收藏
相关论文
共 50 条
  • [21] Prosodic cues of an onomatopoetic word for agent size in infant-directed speech
    Kajikawa, Sachiyo
    Yoshimura, Asami
    Kumasaka, Yoshitaka
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 251 - 252
  • [22] Brain potentials indicate immediate use of prosodic cues in natural speech processing
    Steinhauer, K
    Alter, K
    Friederici, AD
    NATURE NEUROSCIENCE, 1999, 2 (02) : 191 - 196
  • [23] Automatic Evaluation of Parkinson's Speech - Acoustic, Prosodic and Voice Related Cues
    Bocklet, Tobias
    Steidl, Stefan
    Noeth, Elmar
    Skodda, Sabine
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1148 - 1152
  • [24] Brain potentials indicate immediate use of prosodic cues in natural speech processing
    Karsten Steinhauer
    Kai Alter
    Angela D. Friederici
    Nature Neuroscience, 1999, 2 : 191 - 196
  • [25] Controllable speech synthesis by learning discrete phoneme-level prosodic representations
    Ellinas, Nikolaos
    Christidou, Myrsini
    Vioni, Alexandra
    Sung, June Sig
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    Mastorocostas, Paris
    SPEECH COMMUNICATION, 2023, 146 : 22 - 31
  • [26] Decoupling segmental and prosodic cues of non-native speech through vector quantization
    Quamer, Waris
    Das, Anurag
    Gutierrez-Osuna, Ricardo
    INTERSPEECH 2023, 2023, : 2083 - 2087
  • [27] Prosodic cues to syntactic boundaries
    Inst of Psychology, The Chinese Acad of Sciences, Beijing, China
    Shengxue Xuebao, 5 (414-421):
  • [28] Prosodic phrase and cues to parse it
    Huang, XJ
    Yang, YF
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 101 - 102
  • [29] PROSODIC CUES FOR SEGMENTS - PREFACE
    KOHLER, KJ
    PHONETICA, 1986, 43 (1-3) : 7 - 10
  • [30] Analysis of prosodic features: towards modelling of emotional and pragmatic attributes of speech
    Adell, Jordi
    Bonafonte, Antonio
    Escudero, David
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 277 - 283