Analysing fundamental frequency contours and local speech rate in map task dialogs

被引:17
|
作者
Mixdorff, H
Pfitzinger, HR
机构
[1] TFH Berlin Univ Appl Sci, Dept Comp Sci & Media, D-13353 Berlin, Germany
[2] Univ Munich, Dept Phonet & Speech Commun, D-80799 Munich, Germany
关键词
Fujisaki model; perceptual local speech rate; F0; contours; map task;
D O I
10.1016/j.specom.2005.02.019
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The current paper reports first results from the analysis of task-oriented dialogs using a Fujisaki model-based parameterization of F0 contours, as well as a model of the perceptual local speech rate. Two versions of map task style dialogs were examined: (1) the recordings made during the map task proper, (2) readings from scripts of the original dialogs by the same subjects. The first part of this paper presents an analysis of phrase boundaries with respect to form and function. A second issue is the problem of processing fillers, hesitations and repairs within the framework of the Fujisaki model-based analysis. The second part of the paper describes the comparative analysis of spontaneous and read versions of the same dialog fragments with respect to Fujisaki model parameters, contours of the perceptual local speech rate, and other features. In a perception test we asked listeners to identify the speaking style of dialog fragments. Apparently this was possible only for part of the data. Analysis of accent commands and perceptual local speech rate contours still suggested differences between the two speaking styles. The number of accented syllables, the associated accent commands' amplitudes, and the perceptual local speech rate were generally higher in the read than in the spontaneous utterances. These results were almost significant despite the fact that the read version had been well re-enacted by the subjects and therefore did not exactly exhibit typical reading style characteristics. Despite this drawback, the methodology presented here has strong potential for further comparative prosodic studies of speaking styles. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:310 / 325
页数:16
相关论文
共 45 条
  • [1] ANALYSIS OF FUNDAMENTAL FREQUENCY CONTOURS IN SPEECH
    LEVITT, H
    RABINER, LR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02): : 569 - &
  • [2] CHARACTERIZATION OF FUNDAMENTAL-FREQUENCY CONTOURS OF SPEECH
    MAEDA, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S33 - S33
  • [3] Modeling of Fundamental Frequency Contours for HMM-based Speech Synthesis Representation of fundamental frequency contours for statistical speech synthesis
    Hirose, Keikichi
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 171 - 176
  • [4] Generating fundamental frequency contours for speech synthesis in Yoruba
    van Niekerk, Daniel R.
    Barnard, Etienne
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1026 - 1030
  • [5] The role of fundamental frequency contours in the perception of speech against interfering speech
    Binns, Christine
    Culling, John F.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (03): : 1765 - 1776
  • [6] The Effect of Fundamental Frequency on the Intelligibility of Speech With Flattened Intonation Contours
    Watson, Peter J.
    Schlauch, Robert S.
    [J]. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2008, 17 (04) : 348 - 355
  • [7] Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin
    Ni, Jinfu
    Hirose, Keikichi
    [J]. SPEECH COMMUNICATION, 2006, 48 (08) : 989 - 1008
  • [8] A method for automatic extraction of model parameters from fundamental frequency contours of speech
    Narusawa, S
    Minematsu, N
    Hirose, K
    Fujisaki, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 509 - 512
  • [9] The roles of fundamental frequency contours and sentence context in Mandarin Chinese speech intelligibility
    Wang, Jiuju
    Shu, Hua
    Zhang, Linjun
    Liu, Zhaoxing
    Zhang, Yang
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (01): : EL91 - EL97
  • [10] DISCRIMINATION OF FUNDAMENTAL FREQUENCY CONTOURS IN SYNTHETIC SPEECH - IMPLICATIONS FOR MODELS OF PITCH PERCEPTION
    KLATT, DH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 8 - 16