The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages

被引:14
|
作者
Pigoli, Davide [1 ]
Hadjipantelis, Pantelis Z. [2 ]
Coleman, John S. [3 ]
Aston, John A. D. [4 ]
机构
[1] Kings Coll London, London, England
[2] Univ Calif Davis, Davis, CA 95616 USA
[3] Univ Oxford, Oxford, England
[4] Univ Cambridge, Cambridge, England
基金
英国工程与自然科学研究理事会; 英国艺术与人文研究理事会;
关键词
Functional data analysis; Object data; Quantitative linguistics; Spectrograms; FUNCTIONAL DATA; SPECTRAL-ANALYSIS; FREQUENCY; SEPARABILITY; DIFFUSION; INFERENCE; MODEL; JOINT; JIVE;
D O I
10.1111/rssc.12258
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The historical and geographical spread from older to more modern languages has long been studied by examining textual changes and in terms of changes in phonetic transcriptions. However, it is more difficult to analyse language change from an acoustic point of view, although this is usually the dominant mode of transmission. We propose a novel analysis approach for acoustic phonetic data, where the aim will be to model the acoustic properties of spoken words statistically. We explore phonetic variation and change by using a time-frequency representation, namely the log-spectrograms of speech recordings. We identify time and frequency covariance functions as a feature of the language; in contrast, mean spectrograms depend mostly on the particular word that has been uttered. We build models for the mean and covariances (taking into account the restrictions placed on the statistical analysis of such objects) and use these to define a phonetic transformation that models how an individual speaker would sound in a different language, allowing the exploration of phonetic differences between languages. Finally, we map back these transformations to the domain of sound recordings, enabling us to listen to the output of the statistical analysis. The approach proposed is demonstrated by using recordings of the words corresponding to the numbers from 1 to 10 as pronounced by speakers from five different Romance languages.
引用
收藏
页码:1103 / 1145
页数:43
相关论文
共 50 条
  • [31] Exploring Spatial Differences Between 2 US Firearm Mortality Data Sets in 2017
    Herring, Meghan K.
    Kersten, Cassandra A.
    [J]. PREVENTING CHRONIC DISEASE, 2020, 17
  • [32] Active crack evaluation in concrete beams using statistical analysis of acoustic emission data
    Shahidan, S.
    Pullin, R.
    Bunnori, N. M.
    Zuki, S. S. M.
    [J]. INSIGHT, 2017, 59 (01) : 24 - 31
  • [33] Statistical Mapping between Articulatory and Acoustic Data for an Ultrasound-based Silent Speech Interface
    Hueber, Thomas
    Benaroya, Elie-Laurent
    Denby, Bruce
    Chollet, Gerard
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 600 - +
  • [34] Talk Time Differences Between Interregional and IntraregionalCalls to a Crisis Helpline: Statistical Analysis
    Turkington, Robin
    Potts, Courtney
    Mulvenna, Maurice
    Bond, Raymond
    O'Neill, Siobhan
    Ennis, Edel
    Hardcastle, Katie
    Scowcroft, Elizabeth
    Moore, Ciaran
    Hamra, Louise
    [J]. JMIR MENTAL HEALTH, 2024, 11
  • [35] An alternative statistical test on the differences between two means in real biochemical analysis
    Hu, YZ
    Karnes, HT
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 45 (1-2) : 257 - 266
  • [36] Exploring uses of persistent homology for statistical analysis of landmark-based shape data
    Gamble, Jennifer
    Heo, Giseon
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2010, 101 (09) : 2184 - 2199
  • [37] Statistical approaches to identifying significant differences in predictive performance between machine learning and classical statistical models for survival data
    Nasejje, Justine B.
    Whata, Albert
    Chimedza, Charles
    [J]. PLOS ONE, 2022, 17 (12):
  • [39] Exploring the tradeoff between data privacy and utility with a clinical data analysis use case
    Im, Eunyoung
    Kim, Hyeoneui
    Lee, Hyungbok
    Jiang, Xiaoqian
    Kim, Ju Han
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [40] Multivariate Statistical and Correlation Analysis between Acoustic and Geotechnical Variables in Soil Compression Tests Monitored by the Acoustic Emission Technique
    Garcia-Ros, Gonzalo
    Villalva-Leon, Danny Xavier
    Castro, Enrique
    Sanchez-Perez, Juan Francisco
    Valenzuela, Julio
    Conesa, Manuel
    [J]. MATHEMATICS, 2023, 11 (19)