Factors influencing automatic segmental alignment of sociophonetic corpora

被引:8
|
作者
Fromont, Robert [1 ]
Watson, Kevin [1 ,2 ]
机构
[1] Univ Canterbury, New Zealand Inst Language Brain & Behav, Private Bag 4800, Christchurch 8140, New Zealand
[2] Univ Canterbury, Dept Linguist, Private Bag 4800, Christchurch 8140, New Zealand
基金
英国经济与社会研究理事会;
关键词
Alignment; American English; Liverpool English; New Zealand English; sociophonetics;
D O I
10.3366/cor.2016.0101
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Automatically time-aligning utterances at the segmental level is increasingly common practice in phonetic and sociophonetic work because of the obvious benefits it brings in allowing the efficient scaling up of the amount of speech data that can be analysed. The field is arriving at a set of recommended practices for improving alignment accuracy, but methodological differences across studies (e.g., the use of different languages and different measures of accuracy) often mean that direct comparison of the factors which facilitate or hinder alignment can be difficult. In this paper, following a review of the state of the art in automatic segmental alignment, we test the effects of a number of factors on its accuracy. Namely, we test the effects of: (1) the presence or absence of pause markers in the training data, (2) the presence of overlapping speech or other noise, (3) using training data from single or multiple speakers, (4) using different sampling rates, (5) using pre-trained acoustic models versus models trained 'from scratch', and (6) using different amounts of training data. For each test, we examine three different varieties of English, from New Zealand, the USA and the UK. The paper concludes with some recommendations for automatic segmental alignment in general.
引用
收藏
页码:401 / 431
页数:31
相关论文
共 50 条
  • [1] Building parallel corpora by automatic title alignment
    Yang, CC
    Li, KW
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 328 - 339
  • [2] FACTORS INFLUENCING THE SEGMENTAL DEPOSITION OF ATHEROMATOUS MATERIAL
    STEPHENSON, SE
    MANN, GV
    YOUNGER, R
    SCOTT, HW
    ARCHIVES OF SURGERY, 1962, 84 (01) : 49 - 55
  • [3] Analysis of factors influencing alignment accuracy in coaxial alignment system of grating diffraction
    Chen, Weiming
    Hu, Song
    Liu, Yeyi
    Guo, Li
    Guo, Jingping
    Weixi Jiagong Jishu/Microfabrication Technology, 2000, (01): : 45 - 50
  • [4] NP alignment in bilingual corpora
    Recski, Gabor
    Rung, Andras
    Zsedar, Atila
    Kornai, Andras
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3379 - 3382
  • [5] FACTORS INFLUENCING THE DEVELOPMENT AND TRANSFER OF AUTOMATIC PROCESSING
    KRAMER, AF
    STRAYER, DL
    BUCKLEY, J
    PROCEEDINGS OF THE HUMAN FACTORS SOCIETY 33RD ANNUAL MEETING, VOL 2, 1989, : 1248 - 1252
  • [6] A sociophonetic account of onset /s/ weakening in Salvadoran Spanish: Instrumental and segmental analyses
    Brogan, Franny D.
    Bolyanatz, Mariska A.
    LANGUAGE VARIATION AND CHANGE, 2018, 30 (02) : 203 - 230
  • [7] Factors influencing FRS 114 segmental reporting: evidence from Malaysia
    Talha, Mohammad
    Salim, Abdullah Sallehhuddin Abdullah
    Fallatah, Yaser Ahmad
    INTERNATIONAL JOURNAL OF MANAGERIAL AND FINANCIAL ACCOUNTING, 2008, 1 (02) : 184 - 198
  • [8] FACTORS INFLUENCING THE PENETRATION OF WIRES INTO THE NEURAL CANAL DURING SEGMENTAL WIRING
    ZINDRICK, MR
    KNIGHT, GW
    BUNCH, WH
    MILLER, MC
    BUTLER, DM
    LORENZ, M
    BEHAL, R
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1989, 71A (05): : 742 - 750
  • [9] Factors influencing the course and the response to treatment in primary focal segmental glomerulosclerosis
    Alexopoulos, E
    Stangou, M
    Papagianni, A
    Pantzaki, A
    Papadimitriou, M
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2000, 15 (09) : 1348 - 1356
  • [10] Sentence alignment for monolingual comparable corpora
    Barzilay, R
    Elhadad, N
    PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2003, : 25 - 32