Speech transmission index from running speech: A neural network approach

被引:27
|
作者
Li, FF [1 ]
Cox, TJ [1 ]
机构
[1] Univ Salford, Sch Acoust & Elect Engn, Salford M5 4WT, Lancs, England
来源
关键词
D O I
10.1121/1.1558373
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech transmission index (STI) is an important objective parameter concerning speech intelligibility for sound transmission channels. It is normally measured with specific test signals to ensure high accuracy and good repeatability. Measurement with running speech was previously proposed, but accuracy is compromised and hence applications limited. A new approach that uses artificial neural networks to accurately extract the STI from received running speech is developed in this paper. Neural networks are trained on a large set of transmitted speech examples with prior knowledge of the transmission channels' STIs. The networks perform complicated nonlinear function mappings and spectral feature. memorization to enable accurate objective parameter extraction from transmitted speech. Validations via simulations demonstrate the feasibility of this new method on a one-net-one-speech extract basis. In this case, accuracy is comparable with normal measurement methods. This provides an,alternative to standard, measurement techniques, and it is intended that the neural network method can facilitate occupied room acoustic measurements. (C) 2003 Acoustical Society of America.
引用
收藏
页码:1999 / 2008
页数:10
相关论文
共 50 条
  • [1] A neural network for blind identification of speech transmission index
    Li, FF
    Cox, TJ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 757 - 760
  • [2] Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    Larm, P
    Hongisto, V
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (02): : 1106 - 1117
  • [3] A method to determine the speech transmission index from speech waveforms
    Payton, KL
    Braida, LD
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (06): : 3637 - 3648
  • [4] Neural network approach to speech pathology
    Salvatore, AP
    Thorne, NA
    Gross, CM
    Cannito, MP
    [J]. 42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 439 - 442
  • [5] BLIND METHOD OF ESTIMATING SPEECH TRANSMISSION INDEX FROM REVERBERANT SPEECH SIGNALS
    Unoki, Masashi
    Sasaki, Kyohei
    Miyauchi, Ryota
    Akagi, Masato
    Kim, Nam Soo
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [6] A Neural Network Based Approach for Recognition of Basic Emotions from Speech
    Sham-E-Ansari, Md
    Disha, Shaminaj Towfika
    Chowdhury, Atiqul Islam
    Hasan, Md Khairul
    [J]. 2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 807 - 810
  • [7] Speech transmission index or rapid speech transmission index for classrooms? A designer's point of view
    Tang, SK
    Yeung, MH
    [J]. JOURNAL OF SOUND AND VIBRATION, 2004, 276 (1-2) : 431 - 439
  • [8] A neural network approach for human emotion recognition in speech
    Bhatti, MW
    Wang, YJ
    Guan, L
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, : 181 - 184
  • [9] Speech recognition for emotions with neural network: A design approach
    Giripunje, S
    Panat, A
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2004, 3214 : 640 - 645
  • [10] An artificial neural network approach to automatic speech processing
    Siniscalchi, Sabato Marco
    Svendsen, Torbjorn
    Lee, Chin-Hui
    [J]. NEUROCOMPUTING, 2014, 140 : 326 - 338