Automatic utterance segmentation tool for speech corpus

被引:0
|
作者
Ozawa, Mitsuhiro [1 ]
Tsuge, Satoru [2 ]
Shishibori, Masami [2 ]
Kita, Kenji [3 ]
Fukumi, Minoru [2 ]
Ren, Fuji [4 ]
Kuroiwa, Shingo [2 ]
机构
[1] Univ Tokushima, Grad Sch Adv Technol & Sci, Tokushima, Japan
[2] Univ Tokushima, Inst Technol & Sci, Tokushima, Japan
[3] Univ Tokushima, Ctr Adv Informat Technol, Tokushima, Japan
[4] Univ Tokushima, Beijing Univ Posts & Telecommun, Inst Technol & Sci, Tokushima, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, we collect the speech data for investigating an intra-speakers' speech variability over a short and long time. In general, to reduce the load of speakers, the speech data are collected as one file from collecting start to collecting end. Hence, there are some noises, non-speech sections and mistaken sections in this file. Consequently, we must segment this file into individual utterances and select the useful utterances. This process requires a lot of time and efforts. In this paper, we propose an automatic utterance segmentation tool for dividing the collected speech data. The proposed tool is composed of four processes, which are a voice activity detection, speech recognition, a DP matching, and a correct of speech section. For evaluating the proposed tool, we conduct the evaluation experiments using a female speaker's speech data in our corpus. Experimental results show that the proposed method can reduce a filing time by 90% compared to a manual filing. In This paper, first, we introduced the large speech corpus. This speech corpus contains is the speech data collected by specific speaker over long and short time periods. And, we explained the automatic utterance segmentation tool which we made in the case of corpus build. And inspected the validity. As a result, it was demonstrated that the automatic utterance segmentation tool was high-performance. Furthermore, it was demonstrated that speech corpus build became simple by using the automatic utterance segmentation tool.
引用
收藏
页码:401 / +
页数:2
相关论文
共 50 条
  • [31] On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    Cardenal-Lopez, Antonio
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 49 - 58
  • [32] EXPERIMENTS IN AUTOMATIC SEGMENTATION OF CONTINUOUS SPEECH
    DEMORI, R
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1974, 22 (04): : 286 - 286
  • [33] AUTOMATIC PHOEMIC SEGMENTATION OF SPEECH BY RULE
    DIXON, NR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S37 - S37
  • [34] Automatic linguistic segmentation of conversational speech
    Stolcke, A
    Shriberg, E
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1005 - 1008
  • [35] Automatic speech segmentation based on HMM
    Kroul, Martin
    RADIOENGINEERING, 2007, 16 (02) : 56 - 61
  • [36] EXPERIMENTS IN AUTOMATIC SEGMENTATION OF CONTINUOUS SPEECH
    DEMORI, R
    ACUSTICA, 1976, 34 (03): : 158 - 166
  • [37] AUTOMATIC SEGMENTATION OF SPEECH INTO SYLLABIC UNITS
    MERMELSTEIN, P
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 (04): : 880 - 883
  • [38] On the robust automatic segmentation of spontaneous speech
    Petek, B
    Andersen, O
    Dalsgaard, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 913 - 916
  • [39] Robust parameters for automatic segmentation of speech
    SaiJayram, AKV
    Ramasubramanian, V
    Sreenivas, TV
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 513 - 516
  • [40] Automatic phone segmentation of expressive speech
    Charonnat, Laure
    Vidal, Gaelle
    Boeffard, Olivier
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2376 - 2379