Filled Pauses and Lengthenings Detection Based on the Acoustic Features for the Spontaneous Russian Speech

被引:0
|
作者
Verkhodanova, Vasilisa [1 ]
Shapranov, Vladimir [2 ]
机构
[1] SPIIRAS, 39 14th Line, St Petersburg 199178, Russia
[2] Betria Syst Inc, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2014年 / 8773卷
关键词
speech disfluencies; filled pauses; lengthenings; hesitation; speech corpus; spontaneous speech processing; speech recognition; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spontaneous speech processing has a number of problems. Among them there are speech disfluencies. Although most of them are easily treated by speakers and usually do not cause any difficulties for understanding, for Automatic Speech Recognition (ASR) systems their appearance lead to many recognition mistakes. Our paper deals with the most frequent of them (filled pauses and sound lengthenings) basing on the analysis of their acoustical parameters. The method based on the autocorrelation function was used to detect voiced hesitation phenomena and a method of band-filtering was used to detect unvoiced hesitation phenomena. For the experiments on filled pauses and lengthenings detection an especially collected corpus of spontaneous Russian map-task and appointment-task dialogs was used. The accuracy of voiced filled pauses and lengthening detection was 80%. And accuracy of detection of unvoiced fricative lengthening was 66%.
引用
收藏
页码:227 / 234
页数:8
相关论文
共 50 条
  • [41] Excitation Source and Vocal Tract System based Acoustic Features for Detection of Nasals in Continuous Speech
    Nellore, Bhanu Teja
    Dumpala, Sri Harsha
    Nathwani, Karan
    Gangashetty, Suryakanth, V
    INTERSPEECH 2019, 2019, : 166 - 170
  • [42] SVM based Voice Activity Detection by fusing a new acoustic feature PLMS with some existing acoustic features of speech
    Bharti, Shambhu Shankar
    Gupta, Manish
    Agarwal, Suneeta
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1519 - 1530
  • [43] Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
    Mahajan, Pranav
    Baths, Veeky
    FRONTIERS IN AGING NEUROSCIENCE, 2021, 13
  • [44] Robust speech detection in real acoustic backgrounds with perceptually motivated features
    Bach, Joerg-Hendrik
    Anemueller, Joern
    Kollmeier, Birger
    SPEECH COMMUNICATION, 2011, 53 (05) : 690 - 706
  • [45] Speech spoofing detection using SVM and ELM technique with acoustic features
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
  • [46] Acoustic and Data-driven Features for Robust Speech Activity Detection
    Thomas, Samuel
    Mallidi, Sri Harish
    Janu, Thomas
    Hermansky, Hynek
    Mesgarani, Nima
    Zhou, Xinhui
    Shamma, Shihab
    Ng, Tim
    Zhang, Bing
    Long Nguyen
    Matsoukas, Spyros
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1983 - 1986
  • [47] Performance Estimation of Spontaneous Speech Recognition Using Non-Reference Acoustic Features
    Guo, Ling
    Yamada, Takeshi
    Makino, Shoji
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [48] Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist
    Bone, Daniel
    Black, Matthew P.
    Lee, Chi-Chun
    Williams, Marian E.
    Levitt, Pat
    Lee, Sungbok
    Narayanan, Shrikanth
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1042 - 1045
  • [49] Comparison of machine learning algorithms and acoustic features in emotion recognition from spontaneous speech
    Iizuka, Takahisa
    Mori, Hiroki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2022, 43 (04) : 228 - 231
  • [50] Detecting Alzheimer's Disease using Interactional and Acoustic features from spontaneous speech
    Nasreen, Shamila
    Hough, Julian
    Purver, Matthew
    INTERSPEECH 2021, 2021, : 1962 - 1966