On the Automatic Validation of Speech Alignment

被引:0
|
作者
Athanasopoulos, Georgios [1 ]
Macq, Benoit [1 ]
机构
[1] Catholic Univ Louvain, ICTEAM, ELEN, Louvain, Belgium
来源
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2018年
关键词
speech alignment; HMM-based forced alignment; dynamic time warping; alignment assessment;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The alignment of two utterances is the basis of many speech processing applications. The acoustic user interface of such applications should be capable of detecting insufficient alignment results and identifying the responsible input utterances. In this paper, we discuss the automatic validation of speech alignment and propose two new validation algorithms. The first method relies on locating and matching the syllable nuclei of the aligned utterances. The second method performs syllable-level comparison of the speech signal envelopes in accordance to the alignment time-warping path. Experimental results show that the proposed algorithms perform consistently well and can be effectively applied for the validation of different speech alignment methods.
引用
收藏
页码:2105 / 2109
页数:5
相关论文
共 50 条
  • [21] Genetic algorithm for optimizing the nonlinear time alignment of automatic speech recognition systems
    Kwong, S
    Chau, CW
    Halang, WA
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1996, 43 (05) : 559 - 566
  • [22] Analysing rhythm in ritual discourse in Yucatec Maya using automatic speech alignment
    Vapnarsky, Valentina
    Barras, Claude
    Becquey, Cedric
    Doukhan, David
    Adda-Decker, Martine
    Lamel, Lori
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 344 - 348
  • [23] An automatic caption alignment mechanism for off-the-shelf speech recognition technologies
    Federico, Maria
    Furini, Marco
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (01) : 21 - 40
  • [24] EVIDENCE FOR THE STRENGTH OF THE RELATIONSHIP BETWEEN AUTOMATIC SPEECH RECOGNITION AND PHONEME ALIGNMENT PERFORMANCE
    Baghai-Ravary, Ladan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5262 - 5265
  • [25] A Neural Time Alignment Module for End-to-End Automatic Speech Recognition
    Jiang, Dongcheng
    Zhang, Chao
    Woodland, Philip C.
    INTERSPEECH 2023, 2023, : 1374 - 1378
  • [26] An automatic caption alignment mechanism for off-the-shelf speech recognition technologies
    Maria Federico
    Marco Furini
    Multimedia Tools and Applications, 2014, 72 : 21 - 40
  • [27] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
  • [28] Development and validation of an automatic speech-in-noise screening test by telephone
    Smits, C
    Kapteyn, TS
    Houtgast, T
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2004, 43 (01) : 15 - 28
  • [29] Validation of an expressive speech corpus by mapping automatic classification to subjective evaluation
    Iriondo, Ignasi
    Planet, Santiago
    Alias, Francesc
    Socoro, Joan-Claudi
    Martinez, Elisa
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 646 - +
  • [30] VOWEL-BASED FREQUENCY ALIGNMENT FUNCTION DESIGN AND RECOGNITION-BASED TIME ALIGNMENT FOR AUTOMATIC SPEECH MORPHING
    Onishi, Masato
    Takahashi, Toru
    Irino, Toshio
    Kawahara, Hideki
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 25 - +