Preparing a Corpus of Dutch Spontaneous Dialogues for Automatic Phonetic Analysis

被引:0
|
作者
Schuppler, Barbara [1 ]
Ernestus, Mirjam [1 ]
Scharenborg, Odette [1 ]
Boves, Lou [1 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol, Nijmegen, Netherlands
关键词
corpus creation; conversational speech; spontaneous dialogues; reductions; pronunciation variants; automatic phonemic transcription;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the steps needed to make a corpus of Dutch spontaneous dialogues accessible for automatic phonetic research aimed at increasing our understanding of reduction phenomena and the role of fine phonetic detail. Since the corpus was not created with automatic processing in mind, it needed to be reshaped. The first part of this paper describes the actions needed for this reshaping in some detail. The second part reports the results of a preliminary analysis of the reduction phenomena in the corpus. For this purpose a phonemic transcription of the corpus was created by means of a forced alignment, first with a lexicon of canonical pronunciations and then with multiple pronunciation variants per word. In this study pronunciation variants were generated by applying a large set of phonetic processes that have been implicated in reduction to the canonical pronunciations of the words. This relatively straightforward procedure allows us to produce plausible pronunciation variants and to verify and extend the results of previous reduction studies reported in the literature.
引用
收藏
页码:1638 / 1641
页数:4
相关论文
共 50 条
  • [41] Towards Automatic Recognition of the Negotiation Strategies: Analysis of Human-Human Dialogues
    Koit, Mare
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA 2014), 2014, : 170 - 176
  • [42] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition
    Kobayashi, Akio
    Yasu, Keiichi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2294 - 2298
  • [43] CORPUS LINGUISTICS AND THE AUTOMATIC-ANALYSIS OF ENGLISH - OOSTDIJK,N
    JAGER, S
    ENGLISH STUDIES, 1993, 74 (03) : 289 - 290
  • [44] Corpus Analysis and Automatic Detection of Emotion-inducing Keywords
    Yuan, Bo
    He, Xiangqing
    Liu, Ying
    SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [45] A fully automatic method for human corpus callosum MRI analysis
    Liu, H
    Blumenthal, J
    Jeffries, N
    Vaituzis, C
    Zijdenbos, A
    Rapoport, J
    Giedd, J
    NEUROIMAGE, 2001, 13 (06) : S188 - S188
  • [46] Search for lexicosyntactic segmentation and linking indices by an automatic corpus analysis
    Bestgen, Yves
    DISCOURS-REVUE DE LINGUISTIQUE PSYCHOLINGUISTIQUE ET INFORMATIQUE, 2019, (25):
  • [47] Analysis on individual differences in automatic transcription of spontaneous presentations
    Shinozaki, T
    Furui, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 729 - 732
  • [48] Semantic Analysis and Automatic Corpus Construction for Entailment Recognition in Medical Texts
    Ben Abacha, Asma
    Duy Dinh
    Mrabet, Yassine
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 238 - 242
  • [49] Toward Identifying Features for Automatic Gender Detection: A Corpus Creation and Analysis
    Alanazi, Saad Awadh
    IEEE ACCESS, 2019, 7 : 111931 - 111943
  • [50] Analysis of the ORTHOTEL corpus:: the contribution of automatic treatment to the classification of spelling errors
    Aubergé, V
    Ghneim, N
    Belrhali, R
    LANGUE FRANCAISE, 1999, (124): : 90 - 103