Filled Pauses and Lengthenings Detection Based on the Acoustic Features for the Spontaneous Russian Speech

被引：0

作者：

Verkhodanova, Vasilisa ^{[1
]}

Shapranov, Vladimir ^{[2
]}

机构：

[1] SPIIRAS, 39 14th Line, St Petersburg 199178, Russia

[2] Betria Syst Inc, St Petersburg, Russia

来源：

SPEECH AND COMPUTER | 2014年 / 8773卷

关键词：

speech disfluencies; filled pauses; lengthenings; hesitation; speech corpus; spontaneous speech processing; speech recognition; RECOGNITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The spontaneous speech processing has a number of problems. Among them there are speech disfluencies. Although most of them are easily treated by speakers and usually do not cause any difficulties for understanding, for Automatic Speech Recognition (ASR) systems their appearance lead to many recognition mistakes. Our paper deals with the most frequent of them (filled pauses and sound lengthenings) basing on the analysis of their acoustical parameters. The method based on the autocorrelation function was used to detect voiced hesitation phenomena and a method of band-filtering was used to detect unvoiced hesitation phenomena. For the experiments on filled pauses and lengthenings detection an especially collected corpus of spontaneous Russian map-task and appointment-task dialogs was used. The accuracy of voiced filled pauses and lengthening detection was 80%. And accuracy of detection of unvoiced fricative lengthening was 66%.

引用

页码：227 / 234

页数：8

共 50 条

[41] Excitation Source and Vocal Tract System based Acoustic Features for Detection of Nasals in Continuous Speech
Nellore, Bhanu Teja
Dumpala, Sri Harsha
Nathwani, Karan
Gangashetty, Suryakanth, V
INTERSPEECH 2019, 2019, : 166 - 170
[42] SVM based Voice Activity Detection by fusing a new acoustic feature PLMS with some existing acoustic features of speech
Bharti, Shambhu Shankar
Gupta, Manish
Agarwal, Suneeta
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1519 - 1530
[43] Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
Mahajan, Pranav
Baths, Veeky
FRONTIERS IN AGING NEUROSCIENCE, 2021, 13
[44] Robust speech detection in real acoustic backgrounds with perceptually motivated features
Bach, Joerg-Hendrik
Anemueller, Joern
Kollmeier, Birger
SPEECH COMMUNICATION, 2011, 53 (05) : 690 - 706
[45] Speech spoofing detection using SVM and ELM technique with acoustic features
Rahmeni, Raoudha
Ben Aicha, Anis
Ben Ayed, Yassine
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
[46] Acoustic and Data-driven Features for Robust Speech Activity Detection
Thomas, Samuel
Mallidi, Sri Harish
Janu, Thomas
Hermansky, Hynek
Mesgarani, Nima
Zhou, Xinhui
Shamma, Shihab
Ng, Tim
Zhang, Bing
Long Nguyen
Matsoukas, Spyros
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1983 - 1986
[47] Performance Estimation of Spontaneous Speech Recognition Using Non-Reference Acoustic Features
Guo, Ling
Yamada, Takeshi
Makino, Shoji
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[48] Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist
Bone, Daniel
Black, Matthew P.
Lee, Chi-Chun
Williams, Marian E.
Levitt, Pat
Lee, Sungbok
Narayanan, Shrikanth
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1042 - 1045
[49] Comparison of machine learning algorithms and acoustic features in emotion recognition from spontaneous speech
Iizuka, Takahisa
Mori, Hiroki
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2022, 43 (04) : 228 - 231
[50] Detecting Alzheimer's Disease using Interactional and Acoustic features from spontaneous speech
Nasreen, Shamila
Hough, Julian
Purver, Matthew
INTERSPEECH 2021, 2021, : 1962 - 1966

← 1 2 3 4 5 →