Recognition of coded speech transmitted over wireless channels

被引:0
|
作者
Gomez, Angel M. [1 ]
Peinado, Antonio M. [1 ]
Sanchez, Victoria [1 ]
Rubio, Antonio J. [1 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, Fac Ciencias, E-18071 Granada, Spain
关键词
speech recognition; remote speech recognition; cellular radio; speech codecs; transmission errors; decoding; decoded speech signal; error compensation; transcoding;
D O I
10.1109/TWC.2006.1687779
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Network-based speech recognition (NSR) and distributed speech recognition (DSR) have been proposed as solutions to translate speech recognition technologies to mobile environments. NSR is the most straightforward solution since it does not require any modification in the mobile phone, however DSR offers higher robustness against codec compression and transmission channel degradation. This paper explores an alternative approach for remote speech recognition which combines the advantages of NSR and DSR. In this scheme, a standard speech codec is used for speech transmission but the recognition is performed from the received codec parameters. In particular, we focus on the effect of transmission channel errors, which can cause a more severe performance reduction on speech recognition than codec distortion. First, we show that an NSR solution can approach DSR through a reconstruction technique along with an adapted noise reduction technique originally proposed for acoustic noise. Then, these results are improved by working with recognition features directly extracted from the codec bitstream by means of parameter transcoding. Required modifications on current networks in order to access the bitstream are described. The network upgrading with the tandem free operation (TF) protocol is an attractive solution. This upgrade not only offers an overall improvement on the end-to-end speech quality, but would also allow a recognition performance similar, and even higher in poor channel conditions, to that obtained by DSR when parameter transcoding along with the proposed mitigation techniques are applied.
引用
收藏
页码:2555 / 2562
页数:8
相关论文
共 50 条
  • [21] Guessing Random Additive Noise Decoding of Network Coded Data Transmitted Over Burst Error Channels
    Chatzigeorgiou, Ioannis
    Savostyanov, Dmitry
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12842 - 12857
  • [22] Robust decoding of H.264 encoded video transmitted over wireless channels
    Sabeva, Galina
    Ben Jamaa, Salma
    Kieffer, Michel
    Duhamel, Pierre
    2006 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2006, : 9 - +
  • [23] Method for filtering the input transmitted by proportional ILC controllers over wireless delay channels
    Huang, Lixun
    Chen, Hui
    Sun, Lijun
    Chen, Tianfei
    Zhang, Qiuwen
    Zhang, Zhe
    Liu, Weihua
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2024, 128
  • [24] Automatic speech recognition over error-prone wireless networks
    Tan, ZH
    Dalsgaard, P
    Lindberg, B
    SPEECH COMMUNICATION, 2005, 47 (1-2) : 220 - 242
  • [25] Speaker Recognition and Speaker Characterization over Landline, VoIP and Wireless Channels
    Gallardo, Laura Fernandez
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 665 - 670
  • [26] Efficient MMSE-Based channel error mitigation techniques.: Application to distributed speech recognition over wireless channels
    Peinado, AM
    Sánchez, V
    Pérez-Córdoba, JL
    Rubio, AJ
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2005, 4 (01) : 14 - 19
  • [27] Speech recognition for wireless applications
    Weerackody, V
    Reichl, W
    Potamianos, A
    2001 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-10, CONFERENCE RECORD, 2001, : 1047 - 1051
  • [28] Adaptive Coded Modulation for Stabilization of Wireless Networked Control Systems over Binary Erasure Channels
    Royyan, Muhammad
    Vehkapera, Mikko
    Charalambous, Themistoklis
    Wichman, Risto
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 48 - 55
  • [29] Path diversity with a new coded cooperation scheme over multi-hop wireless channels
    Shen, Gang
    Wu, Keyin
    Liu, Erwu
    Wang, Dongyao
    Jin, Shan
    2007 IEEE 65TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-6, 2007, : 140 - 144
  • [30] Coded Cooperative Diversity with Convolutional Codes over Nakagami-m Wireless Fading Channels
    Gebremedhin, Lebanos W.
    Jiang, Fan
    Chen, Chuan-Chiang
    PROCEEDINGS SSST 2011: 43RD IEEE SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2011, : 160 - 162