Classification of non-speech acoustic signals using structure models

被引:0
|
作者
Tschöpe, C [1 ]
Hentschel, D [1 ]
Wolff, M [1 ]
Eichner, M [1 ]
Hoffmann, R [1 ]
机构
[1] Fraunhofer Inst Nondestruct Testing, Dresden, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-speech acoustic signals are widely used as the input of systems for non-destructive testing. In this rapidly growing field, the signals have an increasing complexity leading to the fact that powerful models are required. Methods like DTW and HMM, which are established in speech recognition, have been successfully used but are not sufficient in all cases. We propose the application of generalized structured Markov graphs (SMG). We describe a task independent structure learning technique which automatically adapts the models to the structure of the test signals. We demonstrate that our solution outperforms hand-tuned HMM structures in terms of class discrimination by two case studies using data from real applications.
引用
收藏
页码:653 / 656
页数:4
相关论文
共 50 条
  • [41] Speech/Non-Speech Detection in Malay Language Spontaneous Speech
    Izzad, M.
    Jamil, Nursuriati
    Abu Bakar, Zainab
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 219 - 224
  • [42] NON-SPEECH AUDIO EVENT DETECTION
    Portelo, Jose
    Bugalho, Miguel
    Trancoso, Isabel
    Neto, Joao
    Abad, Alberto
    Serralheiro, Antonio
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1973 - 1976
  • [43] Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification
    Valero, Xavier
    Alias, Francesc
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (06) : 1684 - 1689
  • [44] Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs
    Shi, Ziqiang
    Han, Jiqing
    Zheng, Tieran
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2412 - 2415
  • [45] Cerebral specialization for speech and non-speech stimuli in infants
    Dehaene-Lambertz, G
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2000, 12 (03) : 449 - 460
  • [46] Speech and non-speech measures of audiovisual integration are not correlated
    Jonathan M. P. Wilbiks
    Violet A. Brown
    Julia F. Strand
    Attention, Perception, & Psychophysics, 2022, 84 : 1809 - 1819
  • [47] Speech and non-speech measures of audiovisual integration are not correlated
    Wilbiks, Jonathan M. P.
    Brown, Violet A.
    Strand, Julia F.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2022, 84 (06) : 1809 - 1819
  • [48] Non-speech operated emulation of keyboard
    Sporka, A. J.
    Kurniawan, S. H.
    Slavik, P.
    DESIGNING ACCESSIBLE TECHNOLOGY, 2006, : 145 - +
  • [49] Pattern recognition of non-speech audio
    Aucouturier, Jean-Julien
    Daudet, Laurent
    PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1487 - 1488
  • [50] Discriminating Speech and Non-Speech with Regularized Least Squares
    Rifkin, Ryan
    Mesgarani, Nima
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1974 - 1977