Empty speech pause detection in spontaneous speech

被引:0
|
作者
Stejskal, Vojtech
Bourbakis, Nikolaos
Esposito, Anna
机构
关键词
MODELS;
D O I
10.1109/ICTAI.2009.90
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work describes two new pause detection algorithms and compare their performance with four standard Voice Activity Detection (VAD) methods represented by the adaptive Long Term Spectral Divergence (LTSD) algorithm, the Likelihood Ratio Test (LRT) algorithm, the Neural Network thresholding and G.729. The proposed algorithms exploit the concept of adaptation in order to handle adverse conditions and spontaneous speech properties. The lest data are recordings of spontaneous speech made in noisy environments. The experimental results show that the pet:fill-mance of proposed algorithms on noisy and even artificially cleaned speech are superior than that achieved by standard methods reported in literature.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 50 条
  • [1] Nurturing Filled Pause Detection for Spontaneous Speech Retrieval
    Hamzah, Raseeda
    Jamil, Nursuriati
    Seman, Noraini
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 458 - 469
  • [2] A novel detection method of filled pause in mandarin spontaneous speech
    Li, Yan-Xiong
    He, Qian-Hua
    Li, Tao
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 217 - 222
  • [3] PAUSE REPORTS FOR SPONTANEOUS DIALOGIC SPEECH
    FRIEDMAN, LA
    OCONNELL, DC
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (03) : 223 - 225
  • [4] PAUSE REPORTS FOR SPONTANEOUS DIALOGIC SPEECH
    FRIEDMAN, LA
    OCONNELL, DC
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1990, 28 (06) : 489 - 489
  • [5] Pause of empty words in text-to-speech system
    Pan, Wei-Qiang
    He, Qian-Hua
    Wei, Gang
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2002, 30 (06):
  • [6] Model based speech pause detection
    McKinley, BL
    Whipple, GH
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1179 - 1182
  • [7] Acoustical Analysis of Filled Pause in Malay Spontaneous Speech
    Hamzah, Raseeda
    Jamil, Nursuriati
    Seman, Noraini
    COMPUTER APPLICATIONS FOR COMMUNICATION, NETWORKING, AND DIGITAL CONTENTS, 2012, 350 : 251 - 259
  • [8] Speech/Non-Speech Detection in Malay Language Spontaneous Speech
    Izzad, M.
    Jamil, Nursuriati
    Abu Bakar, Zainab
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 219 - 224
  • [9] THE PHONOLOGY OF THE PAUSE OF SPEECH
    WEINRICH, H
    PHONETICA, 1961, 7 (01) : 4 - 18
  • [10] 'EMPTY SPEECH'
    WALDMAN, A
    NEW YORK QUARTERLY, 1976, 18 : 65 - 65