Factors influencing automatic segmental alignment of sociophonetic corpora

被引:8
|
作者
Fromont, Robert [1 ]
Watson, Kevin [1 ,2 ]
机构
[1] Univ Canterbury, New Zealand Inst Language Brain & Behav, Private Bag 4800, Christchurch 8140, New Zealand
[2] Univ Canterbury, Dept Linguist, Private Bag 4800, Christchurch 8140, New Zealand
基金
英国经济与社会研究理事会;
关键词
Alignment; American English; Liverpool English; New Zealand English; sociophonetics;
D O I
10.3366/cor.2016.0101
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Automatically time-aligning utterances at the segmental level is increasingly common practice in phonetic and sociophonetic work because of the obvious benefits it brings in allowing the efficient scaling up of the amount of speech data that can be analysed. The field is arriving at a set of recommended practices for improving alignment accuracy, but methodological differences across studies (e.g., the use of different languages and different measures of accuracy) often mean that direct comparison of the factors which facilitate or hinder alignment can be difficult. In this paper, following a review of the state of the art in automatic segmental alignment, we test the effects of a number of factors on its accuracy. Namely, we test the effects of: (1) the presence or absence of pause markers in the training data, (2) the presence of overlapping speech or other noise, (3) using training data from single or multiple speakers, (4) using different sampling rates, (5) using pre-trained acoustic models versus models trained 'from scratch', and (6) using different amounts of training data. For each test, we examine three different varieties of English, from New Zealand, the USA and the UK. The paper concludes with some recommendations for automatic segmental alignment in general.
引用
收藏
页码:401 / 431
页数:31
相关论文
共 50 条
  • [41] Fully Automatic Segmentation for Prosodic Speech Corpora
    Hoffmann, Sarah
    Pfister, Beat
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1389 - 1392
  • [42] Automatic creation of WordNets from parallel corpora
    Oliver, Antoni
    Climent, Salvador
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1112 - 1116
  • [43] Automatic Computation of Poetic Creativity in Parallel Corpora
    Zuniga, Daniel F.
    Amido, Teresa
    Camargo, Jorge E.
    ADVANCES IN COMPUTING, CCC 2017, 2017, 735 : 710 - 720
  • [44] Automatic Construction of Discourse Corpora for Dialogue Translation
    Wang, Longyue
    Zhang, Xiaojun
    Tu, Zhaopeng
    Way, Andy
    Liu, Qun
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2748 - 2754
  • [45] An automatic method for generating sense tagged corpora
    Mihalcea, R
    Moldovan, DI
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 461 - 466
  • [46] Robust Automatic Transcription of English Speech Corpora
    Kabir, Ahsanul
    Giurgiu, Mircea
    Barker, Jon
    PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2010, : 79 - 82
  • [47] Specialized Corpora Processing with Automatic Extraction Tools
    Goncharova, Yuliya
    Sanchez Cardenas, Beatriz
    CORPUS RESOURCES FOR DESCRIPTIVE AND APPLIED STUDIES. CURRENT CHALLENGES AND FUTURE DIRECTIONS: SELECTED PAPERS FROM THE 5TH INTERNATIONAL CONFERENCE ON CORPUS LINGUISTICS (CILC2013), 2013, 95 : 293 - 297
  • [48] Automatic phonetic transcription of large speech corpora
    Van Bael, Christophe
    Boves, Lou
    van den Heuvel, Henk
    Strik, Helmer
    COMPUTER SPEECH AND LANGUAGE, 2007, 21 (04): : 652 - 668
  • [49] Derivations of Factors Influencing Segmental Consumer Behaviors Using the RST Combined with Flow Graph and FCA
    Huang, Chi-Yo
    Yang, Ya-Lan
    Tzeng, Gwo-Hshiung
    Yu, Hsiao-Cheng
    Lee, Hong-Yuh
    Cheng, Shih-Tsunsg
    Lo, Sang-Yeng
    ADVANCES IN INTELLIGENT DECISION TECHNOLOGIES, 2010, 4 : 687 - +
  • [50] Exploring the quality of ecosystem services and the segmental impact of influencing factors in resource-based cities
    Deng, Fan
    Zhu, Shichao
    Guo, Jiaxin
    Sun, Xialing
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 375