Factors influencing automatic segmental alignment of sociophonetic corpora

被引:8
|
作者
Fromont, Robert [1 ]
Watson, Kevin [1 ,2 ]
机构
[1] Univ Canterbury, New Zealand Inst Language Brain & Behav, Private Bag 4800, Christchurch 8140, New Zealand
[2] Univ Canterbury, Dept Linguist, Private Bag 4800, Christchurch 8140, New Zealand
基金
英国经济与社会研究理事会;
关键词
Alignment; American English; Liverpool English; New Zealand English; sociophonetics;
D O I
10.3366/cor.2016.0101
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Automatically time-aligning utterances at the segmental level is increasingly common practice in phonetic and sociophonetic work because of the obvious benefits it brings in allowing the efficient scaling up of the amount of speech data that can be analysed. The field is arriving at a set of recommended practices for improving alignment accuracy, but methodological differences across studies (e.g., the use of different languages and different measures of accuracy) often mean that direct comparison of the factors which facilitate or hinder alignment can be difficult. In this paper, following a review of the state of the art in automatic segmental alignment, we test the effects of a number of factors on its accuracy. Namely, we test the effects of: (1) the presence or absence of pause markers in the training data, (2) the presence of overlapping speech or other noise, (3) using training data from single or multiple speakers, (4) using different sampling rates, (5) using pre-trained acoustic models versus models trained 'from scratch', and (6) using different amounts of training data. For each test, we examine three different varieties of English, from New Zealand, the USA and the UK. The paper concludes with some recommendations for automatic segmental alignment in general.
引用
收藏
页码:401 / 431
页数:31
相关论文
共 50 条
  • [21] Discussion on quality-influencing factors in continuous automatic production line
    Lai, Zhicheng
    Li, Feng
    Baopo Qicai/Explosive Materials, 1999, 28 (02): : 11 - 12
  • [22] Examining Factors Influencing the Viability of Automatic Acoustic Analysis of Child Speech
    Knowles, Thea
    Clayards, Meghan
    Sonderegger, Morgan
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2018, 61 (10): : 2487 - 2501
  • [23] A new Alignment algorithm for Parallel Corpora of Japanese and Chinese
    Quan, Yuhua
    Jin, Ying-hao
    Quan, Jingji
    2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, : 3498 - 3501
  • [24] CATS: A Tool for Customized Alignment of Text Simplification Corpora
    Stajner, Sanja
    Franco-Salvador, Marc
    Rosso, Paolo
    Ponzetto, Simone Paolo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3895 - 3903
  • [25] Citation Matching in Sanskrit Corpora Using Local Alignment
    Prasad, Abhinandan S.
    Rao, Shrisha
    SANSKRIT COMPUTATIONAL LINGUISTICS, 2010, 6465 : 124 - 136
  • [26] Parallel Sentence Alignment from Biomedical Comparable Corpora
    Cardon, Remi
    Grabar, Natalia
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 362 - 366
  • [27] Studying the Factors Influencing Automatic User Task Detection on the Computer Desktop
    Rath, Andreas S.
    Devaurs, Didier
    Lindstaedt, Stefanie N.
    SUSTAINING TEL: FROM INNOVATION TO LEARNING AND PRACTICE, 2010, 6383 : 292 - +
  • [28] A Quantitative Analysis and Sentence Alignment for Parallel Corpora of ShiJi
    Liu, Ying
    Wang, Nan
    Yuan, Bo
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2016, 23 (01) : 71 - 108
  • [29] Context-based sentence alignment in parallel corpora
    Bicici, Ergun
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 434 - 444
  • [30] Classification and Selection of Translation Candidates for Parallel Corpora Alignment
    Kavitha, K. M.
    Gomes, Luis
    Aires, Jose
    Lopes, Jose Gabriel P.
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 723 - 734