An artificial neural network approach to automatic speech processing

被引:58
|
作者
Siniscalchi, Sabato Marco [1 ,2 ]
Svendsen, Torbjorn [3 ]
Lee, Chin-Hui [2 ]
机构
[1] Kore Univ Enna, Fac Engn & Architecture, Enna, Sicily, Italy
[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[3] NTNU, Dept Elect & Telecommun, Trondheim, Norway
关键词
Artificial neural networks; Deep neural networks; Acoustic feature modeling; Connectionist automatic speech recognition; Automatic language recognition; TEMPORAL PATTERNS; RECOGNITION; INFORMATION; PREDICTION; ASR;
D O I
10.1016/j.neucom.2014.03.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An artificial neural network (ANN) is a powerful mathematical framework used to either model complex relationships between inputs and outputs or find patterns in data. It is based on an interconnected group of artificial neurons, and it employs a connectionist approach to computation when processing information. ANNs have been successfully used for a great variety of applications, such as decision making, quantum chemistry, radar systems, face identification, gesture recognition, handwritten text recognition, medical diagnosis, financial applications, robotics, data mining, and e-spam filtering. In the speech community, neural architectures have been used since the beginning of the 1980s, and ANNs have been proven useful to accomplish several speech processing tasks, e.g., to extract linguistically motivated features, to perform speech detection, and to generate local scores to be used for different goals. In recent years, there has been a renewed interest in the use of ANNs for speech applications due to a major advance made in pre-training the weights in deep neural networks (DNNs). It seems that a new trend to move the speech technology forward through the use of NNs has begun, and it can therefore be instructive to review key ANN applications to automatic speech processing. In this paper, several ANN-based applications for speech processing will be presented, ranging from speech attribute extraction to phoneme estimation and/or classification. Furthermore, it will be shown that ANNs play a key role in several important speech applications, such as large vocabulary continuous speech recognition (LVCSR) and automatic language recognition. The goal of the paper is to summarize chief ANN approaches to speech processing using the experience gathered in the last seven years in our laboratories. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:326 / 338
页数:13
相关论文
共 50 条
  • [1] Artificial neural network approach to automatic classification of stellar spectra
    Rodríguez, A
    Dafonte, C
    Arcay, B
    Manteiga, M
    [J]. ARTIFICIAL NEURAL NETS PROBLEM SOLVING METHODS, PT II, 2003, 2687 : 639 - 646
  • [2] Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
    Tang, Xian
    [J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 682 - 685
  • [3] An artificial neural network approach for predicting architectural speech security (L)
    [J]. Xu, J., 1709, Acoustical Society of America (117):
  • [4] An artificial neural network approach for predicting architectural speech security (L)
    Xu, JF
    Bradley, JS
    Gover, BN
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (04): : 1709 - 1712
  • [5] Morphological image processing and artificial neural network for automatic target recognition (ATR)
    Dubey, A.C.
    Nevis, A.J.
    Kennedy, R.A.
    Dobeck, G.
    Moritz, E.
    [J]. Proceedings of the Workshop on Neural Networks: Academic/Industrial/NASA/Defense, 1991,
  • [6] Automatic Speech Recognition by Cuckoo Search Optimization based Artificial Neural Network Classifier
    Mendiratta, Sunanda
    Turk, Neelam
    Bansal, Dipali
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [7] Artificial Neural Network Approach of Cosmic Ray Primary Data Processing
    Paschalis, P.
    Sarlanis, C.
    Mavromichalaki, H.
    [J]. SOLAR PHYSICS, 2013, 282 (01) : 303 - 318
  • [8] Artificial Neural Network Approach of Cosmic Ray Primary Data Processing
    P. Paschalis
    C. Sarlanis
    H. Mavromichalaki
    [J]. Solar Physics, 2013, 282 : 303 - 318
  • [9] A Novel Approach for Odia Part of Speech Tagging Using Artificial Neural Network
    Das, Bishwa Ranjan
    Patnaik, Srikanta
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 147 - 154
  • [10] An approach to new GUI for speech training aid by using artificial neural network
    Nagayama, I
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: KNOWLEDGE AND MEDIA IN LEARNING SYSTEMS, 1997, 39 : 637 - 640