Empty speech pause detection in spontaneous speech

被引:0
|
作者
Stejskal, Vojtech
Bourbakis, Nikolaos
Esposito, Anna
机构
关键词
MODELS;
D O I
10.1109/ICTAI.2009.90
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work describes two new pause detection algorithms and compare their performance with four standard Voice Activity Detection (VAD) methods represented by the adaptive Long Term Spectral Divergence (LTSD) algorithm, the Likelihood Ratio Test (LRT) algorithm, the Neural Network thresholding and G.729. The proposed algorithms exploit the concept of adaptation in order to handle adverse conditions and spontaneous speech properties. The lest data are recordings of spontaneous speech made in noisy environments. The experimental results show that the pet:fill-mance of proposed algorithms on noisy and even artificially cleaned speech are superior than that achieved by standard methods reported in literature.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 50 条
  • [21] Speech Recognition with Word Fragment Detection Using Prosody Features for Spontaneous Speech
    Yeh, Jui-Feng
    Yen, Ming-Chi
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2012, 6 (02): : 669S - 675S
  • [22] Pause Insertion in Assamese Synthesized Speech Using Speech Specific Features
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [23] Speech and pause characteristics following speech rate reduction in hypokinetic dysarthria
    Hammen, VL
    Yorkston, KM
    JOURNAL OF COMMUNICATION DISORDERS, 1996, 29 (06) : 429 - 445
  • [24] PAUSE FREQUENCY IN FLUENT AND NONFLUENT SPEECH
    LOVE, LR
    CHRISTEN.JM
    STARBUCK, HB
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 51 (01): : 122 - &
  • [25] PAUSE LOCI IN STUTTERED AND NORMAL SPEECH
    WINGATE, ME
    JOURNAL OF FLUENCY DISORDERS, 1984, 9 (03) : 227 - 235
  • [26] Automatic Pause Marking for Speech Synthesis
    Singh, Loitongbam Gyanendro
    Adiga, Nagaraj
    Sharma, Bidisha
    Singh, Sanasam Ranbir
    Prasanna, S. R. M.
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1790 - 1794
  • [27] Modeling pause for the synthesis of Kazakh speech
    Kaliyev, Arman
    Rybin, Sergey, V
    Matveev, Yuri N.
    Kaziyeva, Nazym
    Burambayeva, Nursaule
    ICEMIS'18: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON ENGINEERING AND MIS, 2018,
  • [28] A NEW DEVICE FOR SPEECH PAUSE ANALYSIS
    REICH, B
    SHARMA, C
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1970, 18 (01): : 68 - &
  • [29] Speech-to-text and speech-to-speech summarization of spontaneous speech
    Furui, S
    Kikuchi, T
    Shinnaka, Y
    Hori, C
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 401 - 408
  • [30] Dementia Detection by Analyzing Spontaneous Mandarin Speech
    Liu, Zhaoci
    Guo, Zhiqiang
    Ling, Zhenhua
    Wang, Shijin
    Jin, Lingjing
    Li, Yunxia
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 289 - 296