Combining Acoustic, Lexical, and Syntactic Evidence for Automatic Unsupervised Prosody Labeling

被引:0
|
作者
Ananthakrishnan, Sankaranarayanan [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Speech Anal & Interpretat Lab, Los Angeles, CA 90089 USA
关键词
prosody recognition; accent; stress; prominence; prosodic boundary; spoken language processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic labeling of prosodic events in speech has potentially significant implications for spoken language processing applications, and has received much attention over the years, especially after the introduction of annotation standards such as ToBI. Current labeling techniques are based on supervised learning, relying on the availability of a corpus that is annotated with the prosodic labels of interest in order to train the system. However, creating such resources is an expensive and time-consuming task. In this paper, we examine an unsupervised labeling algorithm for accent (prominence) and prosodic phrase boundary detection at the linguistic syllable level, and evaluate their performance on an standard, manually annotated corpus. We obtain labeling accuracies of 77.8% and 88.5% for the accent and boundary labeling tasks, respectively. These figures compare well against previously reported performance levels for supervised labelers.
引用
收藏
页码:297 / 300
页数:4
相关论文
共 50 条
  • [1] Acoustic-syntactic maximum entropy model for automatic prosody labeling
    Rangarajan, Vivek
    Narayanan, Shrikanth
    Bangalore, Srinivas
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 74 - +
  • [2] Exploiting acoustic and syntactic features for automatic prosody labeling in a maximum entropy framework
    Sridhar, Vivek Kumar Rangarajan
    Bangalore, Srinivas
    Narayanan, Shrikanth S.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04): : 797 - 811
  • [3] Automatic prosodic event detection using acoustic, lexical, and syntactic evidence
    Ananthakrishnan, Sankaranarayanan
    Narayanan, Shrikanth S.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 216 - 228
  • [4] Automatic prosody labeling using both text and acoustic information
    Ma, XJ
    Zhang, W
    Shi, Q
    Zhu, WB
    Shen, LQ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 516 - 519
  • [5] AUTOMATIC PROSODY BOUNDARY LABELING OF MANDARIN USING BOTH TEXT AND ACOUSTIC INFORMATION
    Ni, Chongjia
    Liu, Wenju
    Xu, Bo
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 354 - 357
  • [6] Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition
    Ananthakrishnan, Sankaranarayanan
    Narayanan, Shrikanth
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 138 - 149
  • [7] An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model
    Chen, K
    Hasegawa-Johnson, M
    Cohen, A
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 509 - 512
  • [8] Unsupervised joint prosody labeling and modeling for Mandarin speech
    Chiang, Chen-Yu
    Chen, Sin-Horng
    Yu, Hsiu-Min
    Wang, Yih-Ru
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (02): : 1164 - 1183
  • [9] Combining Acoustic-Prosodic, Lexical, and Phonotactic Features for Automatic Deception Detection
    Levitan, Sarah Ita
    An, Guozhen
    Ma, Min
    Levitan, Rivka
    Rosenberg, Andrew
    Hirschberg, Julia
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2006 - 2010
  • [10] Prosody as syntactic evidence The view from Mayan
    Royer, Justin
    [J]. NATURAL LANGUAGE & LINGUISTIC THEORY, 2022, 40 (01) : 239 - 284