Classification of non-speech acoustic signals using structure models

被引：0

作者：

Tschöpe, C ^{[1
]}

Hentschel, D ^{[1
]}

Wolff, M ^{[1
]}

Eichner, M ^{[1
]}

Hoffmann, R ^{[1
]}

机构：

[1] Fraunhofer Inst Nondestruct Testing, Dresden, Germany

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-speech acoustic signals are widely used as the input of systems for non-destructive testing. In this rapidly growing field, the signals have an increasing complexity leading to the fact that powerful models are required. Methods like DTW and HMM, which are established in speech recognition, have been successfully used but are not sufficient in all cases. We propose the application of generalized structured Markov graphs (SMG). We describe a task independent structure learning technique which automatically adapts the models to the structure of the test signals. We demonstrate that our solution outperforms hand-tuned HMM structures in terms of class discrimination by two case studies using data from real applications.

引用

页码：653 / 656

页数：4

共 50 条

[41] Speech/Non-Speech Detection in Malay Language Spontaneous Speech
Izzad, M.
Jamil, Nursuriati
Abu Bakar, Zainab
2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 219 - 224
[42] NON-SPEECH AUDIO EVENT DETECTION
Portelo, Jose
Bugalho, Miguel
Trancoso, Isabel
Neto, Joao
Abad, Alberto
Serralheiro, Antonio
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1973 - 1976
[43] Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification
Valero, Xavier
Alias, Francesc
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (06) : 1684 - 1689
[44] Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs
Shi, Ziqiang
Han, Jiqing
Zheng, Tieran
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2412 - 2415
[45] Cerebral specialization for speech and non-speech stimuli in infants
Dehaene-Lambertz, G
JOURNAL OF COGNITIVE NEUROSCIENCE, 2000, 12 (03) : 449 - 460
[46] Speech and non-speech measures of audiovisual integration are not correlated
Jonathan M. P. Wilbiks
Violet A. Brown
Julia F. Strand
Attention, Perception, & Psychophysics, 2022, 84 : 1809 - 1819
[47] Speech and non-speech measures of audiovisual integration are not correlated
Wilbiks, Jonathan M. P.
Brown, Violet A.
Strand, Julia F.
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2022, 84 (06) : 1809 - 1819
[48] Non-speech operated emulation of keyboard
Sporka, A. J.
Kurniawan, S. H.
Slavik, P.
DESIGNING ACCESSIBLE TECHNOLOGY, 2006, : 145 - +
[49] Pattern recognition of non-speech audio
Aucouturier, Jean-Julien
Daudet, Laurent
PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1487 - 1488
[50] Discriminating Speech and Non-Speech with Regularized Least Squares
Rifkin, Ryan
Mesgarani, Nima
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1974 - 1977

← 1 2 3 4 5 →