An Automatic System for Detecting Prosodic Prominence in American English Continuous Speech

被引:0
|
作者
Tamburini, F. [1 ,2 ]
Caini, C. [3 ]
机构
[1] Univ Bologna, Ctr Interfacolta Linguist Teor & Appl, Bologna, Italy
[2] Dipartimento Elettr Informat & Sistemist, Bologna, Italy
[3] Univ Bologna, Dipartimento Elettr Informat & Sistemist, Bologna, Italy
关键词
prosody; automatic feature extraction; prominence; stress accent; pitch accent;
D O I
10.1007/s10772-005-4760-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A precise identification of prosodic phenomena and the construction of tools able to properly manage such phenomena are essential steps to disambiguate the meaning of certain utterances. In particular they are useful for a wide variety of tasks: automatic recognition of spontaneous speech, automatic enhancement of speechgeneration systems, solving ambiguities in natural language interpretation, the construction of large annotated language resources, such as prosodically tagged speech corpora, and teaching languages to foreign students using Computer Aided Language Learning (CALL) systems. This paper presents a study on the automatic detection of prosodic prominence in continuous speech, with particular reference to American English, but with good prospects of application to other languages. Prosodic prominence involves two different prosodic features: pitch accent and stress accent. Pitch accent is acoustically connected with fundamental frequency (F0) movements and overall syllable energy, whereas stress exhibits a strong correlation with syllable nuclei duration and mid-to-high-frequency emphasis. This paper shows that a careful measurement of these acoustic parameters, as well as the identification of their connection to prosodic parameters, makes it possible to build an automatic system capable of identifying prominent syllables in utterances with performance comparable with the inter-human agreement reported in the literature. Two different prominence detectors were studied and developed: the first uses a training corpus to set up thresholds properly, while the second uses a pure unsupervised method. In both cases, it is worth stressing that only acoustic parameters derived directly from speech waveforms are exploited.
引用
收藏
页码:33 / 44
页数:12
相关论文
共 50 条
  • [21] Automatic detection of a prosodic hierarchy in a journalistic speech corpus
    Gendrot, Cedric
    Gerdes, Kim
    Adda-Decker, Martine
    LANGUE FRANCAISE, 2016, (191): : 123 - +
  • [22] Prosodic Processing for the Automatic Synthesis of Emotional Russian Speech
    Kaliyev, Arman
    Matveev, Yuri N.
    Lyakso, Elena E.
    Rybin, Sergey V.
    2018 IEEE INTERNATIONAL CONFERENCE QUALITY MANAGEMENT, TRANSPORT AND INFORMATION SECURITY, INFORMATION TECHNOLOGIES (IT&QM&IS), 2018, : 653 - 655
  • [23] Coarticulatory vowel nasalization in American English: Data of individual differences in acoustic realization of vowel nasalization as a function of prosodic prominence and boundary
    Kim, Daejin
    Kim, Sahyang
    DATA IN BRIEF, 2019, 27
  • [24] The relationship between the prosodic prominence of speech and the degree of intelligibility in preadolescent asperger boys' interaction
    Wiklund, Mari
    EUROPEAN CHILD & ADOLESCENT PSYCHIATRY, 2013, 22 : S219 - S220
  • [25] A SYSTEM OF RECORDING AMERICAN-ENGLISH SPEECH SOUNDS
    LORE, JI
    VOLTA REVIEW, 1961, 63 (09) : 433 - 434
  • [26] Levodopa-Based Changes on Vocalic Speech Movements during Prosodic Prominence Marking
    Thies, Tabea
    Muecke, Doris
    Dano, Richard
    Barbe, Michael T.
    BRAIN SCIENCES, 2021, 11 (05)
  • [27] Cortical processing of discrete prosodic patterns in continuous speech
    G. Nike Gnanateja
    Kyle Rupp
    Fernando Llanos
    Jasmine Hect
    James S. German
    Tobias Teichert
    Taylor J. Abel
    Bharath Chandrasekaran
    Nature Communications, 16 (1)
  • [28] PARENTHETICAL - A SPECIAL TYPE OF PROSODIC REDUCTION IN CONTINUOUS SPEECH
    Tseng, Chiu-yu
    Chen, Helen Kai-yun
    Chen, Yen-Hsing
    2018 ORIENTAL COCOSDA - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2018, : 21 - 26
  • [29] DETECTING NASALS IN CONTINUOUS SPEECH
    MERMELSTEIN, P
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 61 (02): : 581 - 587
  • [30] DETECTING NASALS IN CONTINUOUS SPEECH
    MERMELSTEIN, P
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S97 - S97