Multi-source neural networks for speech recognition

被引:0
|
作者
Albesano, D. [1 ]
Gemello, R. [1 ]
Mana, F. [1 ]
机构
[1] CSELT
来源
CSELT Technical Reports | 2000年 / 28卷 / 03期
关键词
Algorithms - Fast Fourier transforms - Feature extraction - Markov processes - Mathematical models - Speech recognition - Telephone;
D O I
暂无
中图分类号
学科分类号
摘要
In speech recognition the most diffused technology (Hidden Markov Models) is constrained by the condition of stochastic independence of its input features. That limits the simultaneous use of features derived from the speech signal with different processing algorithms. On the contrary Artificial Neural Networks (ANN) are capable of incorporating multiple heterogeneous input features, which do not need to be treated as independent, finding the optimal combination of these features for classification. The purpose of this work is the exploitation of this characteristic of ANN to improve the speech recognition accuracy through the combined use of input features coming from different sources (different feature extraction algorithms). In this work we integrate two input sources: the Mel based Cepstral Coefficients (MFCC) derived from FFT and the RASTA-PLP Cepstral Coefficients. The results show that this integration leads to an error reduction of 26% on a telephone quality test set.
引用
收藏
页码:301 / 309
相关论文
共 50 条
  • [41] The multi-source model for dimensioning data networks
    Bonald, T.
    Comte, C.
    [J]. COMPUTER NETWORKS, 2016, 109 : 225 - 233
  • [42] Degrees of Freedom of Multi-Source Relay Networks
    Jeon, Sang-Woon
    Chung, Sae-Young
    Jafar, Syed A.
    [J]. 2009 47TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1 AND 2, 2009, : 388 - +
  • [43] Cooperative Communication in Multi-Source Line Networks
    Girnyk, Maksym A.
    Xiao, Ming
    Rasmussen, Lars K.
    [J]. 2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012,
  • [44] Multi-Source Energy Networks and the ICT Revolution
    Knell, Mark
    [J]. EUROPEAN PLANNING STUDIES, 2013, 21 (11) : 1838 - 1852
  • [45] Multi-source image fusion algorithm based on cellular neural networks with genetic algorithm
    Li, Jiangyang
    Peng, Zhenming
    [J]. OPTIK, 2015, 126 (24): : 5230 - 5236
  • [46] Multi-Source Domain Adaptation with Fuzzy-Rule based Deep Neural Networks
    Li, Keqiuyin
    Lu, Jie
    Zuo, Hua
    Zhang, Guangquan
    [J]. IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [47] Capacity of a Class of Multi-source Relay Networks
    Jeon, Sang-Woon
    Chung, Sae-Young
    [J]. 2009 INFORMATION THEORY AND APPLICATIONS WORKSHOP, 2009, : 62 - 66
  • [48] Power Bargaining in Multi-Source Relay Networks
    Cao, Qian
    Jing, Yindi
    Zhao, H. Vicky
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2012,
  • [49] DISTRIBUTED SOURCE CODING FOR MULTI-SOURCE ESTIMATION IN SENSOR NETWORKS
    Gang, Ren
    Vosoughi, Azadeh
    [J]. 2008 IEEE 9TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2, 2008, : 56 - 60
  • [50] Unsupervised Cross-Corpus Speech Emotion Recognition Using a Multi-Source Cycle-GAN
    Su, Bo-Hao
    Lee, Chi-Chun
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1991 - 2004