Classification of non-speech acoustic signals using structure models

被引:0
|
作者
Tschöpe, C [1 ]
Hentschel, D [1 ]
Wolff, M [1 ]
Eichner, M [1 ]
Hoffmann, R [1 ]
机构
[1] Fraunhofer Inst Nondestruct Testing, Dresden, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-speech acoustic signals are widely used as the input of systems for non-destructive testing. In this rapidly growing field, the signals have an increasing complexity leading to the fact that powerful models are required. Methods like DTW and HMM, which are established in speech recognition, have been successfully used but are not sufficient in all cases. We propose the application of generalized structured Markov graphs (SMG). We describe a task independent structure learning technique which automatically adapts the models to the structure of the test signals. We demonstrate that our solution outperforms hand-tuned HMM structures in terms of class discrimination by two case studies using data from real applications.
引用
收藏
页码:653 / 656
页数:4
相关论文
共 50 条
  • [21] Optimizing speech/non-speech classifier design using AdaBoost
    Kwon, OW
    Lee, TW
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 436 - 439
  • [22] Effect of continuous speech and non-speech signals on stuttering frequency in adults who stutter
    Dayalu, Vikram N.
    Guntupalli, Vijaya K.
    Kalinowski, Joseph
    Stuart, Andrew
    Saltuklaroglu, Tim
    Rastatter, Michael P.
    LOGOPEDICS PHONIATRICS VOCOLOGY, 2011, 36 (03) : 121 - 127
  • [23] The effect of situation-specific non-speech acoustic cues on the intelligibility of speech in noise
    Ward, Lauren
    Shirley, Ben
    Tang, Yan
    Davies, William J.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2958 - 2962
  • [24] Correlation between acoustic speech characteristics and non-speech motor performance in Parkinson Disease
    Goberman, AM
    MEDICAL SCIENCE MONITOR, 2005, 11 (03): : CR109 - CR116
  • [25] VOWELS, CONSONANTS, SPEECH, AND NON-SPEECH
    ADES, AE
    PSYCHOLOGICAL REVIEW, 1977, 84 (06) : 524 - 530
  • [26] LOCALIZATION OF SPEECH AND NON-SPEECH SOUNDS
    SHIGENO, S
    OYAMA, T
    JAPANESE PSYCHOLOGICAL RESEARCH, 1983, 25 (02) : 112 - 117
  • [27] Robust speech and non-speech detection
    Tian, Y
    Wang, ZY
    Lu, DJ
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (01): : 79 - 82
  • [28] Discriminating between Imagined Speech and Non-Speech Tasks using EEG
    AlSaleh, Mashael
    Moore, Roger
    Christensen, Heidi
    Arvaneh, Mahnaz
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 1952 - 1955
  • [29] On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks
    Mertens, Robert
    Huang, Po-Sen
    Gottlieb, Luke
    Friedland, Gerald
    Divakaran, Ajay
    Hasegawa-Johnson, Mark
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2012, 3 (03): : 1 - 19
  • [30] Robust speech/non-speech detection using LDA applied to MFCC
    Martin, A
    Charlet, D
    Mauuary, L
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 237 - 240