Identification and Automatic Detection of Parasitic Speech Sounds

被引:0
|
作者
Matousek, Jindrich [1 ]
Skarnitzl, Radek [2 ]
Machac, Pavel [2 ]
Trmal, Jan [1 ]
机构
[1] Univ West Bohemia, Fac Sci Appl, Dept Cybernet, Bohemia, Czech Republic
[2] Charles Univ Prague, Inst Phonet, Fac Arts & Philosophy, Prague, Czech Republic
关键词
parasitic speech sound; linguistic naturalness; speech synthesis; unit selection; HMM; BVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents initial experiments with the identification and automatic detection of parasitic sounds in speech signals. The main goal of this study is to identify such sounds in the source recordings for unit-selection-based speech synthesis systems and thus to avoid their unintended usage in synthesised speech. The first part of the paper describes the phonetic analysis and identification of parasitic phenomena in recordings of two Czech speakers. In the second part, experiments with the automatic detection of parasitic sounds using HMM-based and BVM classifiers are presented. The results are encouraging, especially those for glottalization phenomena.
引用
收藏
页码:840 / +
页数:2
相关论文
共 50 条
  • [1] Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis
    Matousek, Jindrich
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 369 - 376
  • [2] DETECTION AND IDENTIFICATION OF SPEECH SOUNDS USING CORTICAL ACTIVITY PATTERNS
    Centanni, T. M.
    Sloan, A. M.
    Reed, A. C.
    Engineer, C. T.
    Rennaker, R. L., II
    Kilgard, M. P.
    [J]. NEUROSCIENCE, 2014, 258 : 292 - 306
  • [3] IDENTIFICATION OF TURBULENT SPEECH SOUNDS
    WIREN, J
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1957, 29 (11): : 1255 - 1255
  • [4] Delayed automatic detection of change in speech sounds in adults with autism: A magnetoencephalographic study
    Kasai, K
    Hashimoto, O
    Kawakubo, Y
    Yumoto, M
    Kamio, S
    Itoh, K
    Koshida, I
    Iwanami, A
    Nakagome, K
    Fukuda, M
    Yamasue, H
    Yamada, H
    Abe, O
    Aoki, S
    Kato, N
    [J]. CLINICAL NEUROPHYSIOLOGY, 2005, 116 (07) : 1655 - 1664
  • [5] Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings
    Zhou, Jingting
    Garcia-Romero, Daniel
    Espy-Wilson, Carol
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2544 - 2547
  • [6] Automatic Speech Intelligibility Detection for Speakers with Speech Impairments: The Identification of Significant Speech Features
    Rosdi, Fadhilah
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    Zin, Nor Azan Mat
    [J]. SAINS MALAYSIANA, 2019, 48 (12): : 2737 - 2747
  • [7] INFRASONIC CUES FOR AUTOMATIC RECOGNITION OF SPEECH SOUNDS
    MYASNIKO.
    MYASNIKO.EN
    PEKELNYI, MY
    TRILESNIK, A
    [J]. SOVIET PHYSICS ACOUSTICS-USSR, 1969, 14 (04): : 522 - +
  • [8] Automatic detection of obstructive sleep apnea based on speech or snoring sounds: a narrative review
    Cao, Shuang
    Rosenzweig, Ivana
    Bilotta, Federico
    Jiang, Hong
    Xia, Ming
    [J]. JOURNAL OF THORACIC DISEASE, 2024, 16 (04) : 2654 - 2667
  • [9] Response Advantage for the Identification of Speech Sounds
    Moskowitz, Howard S.
    Lee, Wei Wei
    Sussman, Elyse S.
    [J]. FRONTIERS IN PSYCHOLOGY, 2020, 11
  • [10] IDENTIFICATION OF STATIONARY PARTS OF SPEECH SOUNDS
    MULLER, FA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (04): : 767 - 767