Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction

被引:10
|
作者
Sainath, Tara N. [1 ]
Narayanan, Arun [1 ]
Weiss, Ron J. [1 ]
Variani, Ehsan [1 ]
Wilson, Kevin W. [1 ]
Bacchiani, Michiel [1 ]
Shafran, Izhak [1 ]
机构
[1] Google Inc, New York, NY 10011 USA
关键词
D O I
10.21437/Interspeech.2016-92
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, we presented a multichannel neural network model trained to perform speech enhancement jointly with acoustic modeling [1], directly from raw waveform input signals. While this model achieved over a 10% relative improvement compared to a single channel model, it came at a large cost in computational complexity, particularly in the convolutions used to implement a time-domain filterbank. In this paper we present several different approaches to reduce the complexity of this model by reducing the stride of the convolution operation and by implementing filters in the frequency domain. These optimizations reduce the computational complexity of the model by a factor of 3 with no loss in accuracy on a 2,000 hour Voice Search task.
引用
收藏
页码:1971 / 1975
页数:5
相关论文
共 50 条
  • [1] Reducing the computational complexity for whole word models
    Soltau, Hagen
    Liao, Hank
    Sak, Hasim
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 63 - 68
  • [2] Reducing Data Complexity in Feature Extraction and Feature Selection for Big Data Security Analytics
    Sisiaridis, Dimitrios
    Markowitch, Olivier
    [J]. 2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, : 43 - 48
  • [3] REDUCING THE COMPLEXITY OF COMPUTATIONAL MODELS OF NEURONS USING BIFURCATION DIAGRAMS
    Oprisan, Sorinel A.
    [J]. REVUE ROUMAINE DE CHIMIE, 2009, 54 (06) : 465 - 475
  • [4] Feature Extraction of Underwater Acoustic Signal Using Mode Decomposition and Measuring Complexity
    Li, Yaan
    Li, Yuxing
    [J]. PROCEEDINGS OF 2018 15TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2018, : 757 - 763
  • [5] Reducing computational complexity in hyperspectral anomaly detection: a feature level fusion approach.
    Acito, N.
    Corsini, G.
    Diani, M.
    Greco, M.
    [J]. 2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 1804 - 1807
  • [6] Articulatory Feature Extraction from Voice and Their Impact on Hybrid Acoustic Models
    Llombart, Jorge
    Miguel, Antonio
    Lleida, Eduardo
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 138 - 147
  • [7] Articulatory feature extraction from voice and their impact on hybrid acoustic models
    Llombart, Jorge
    Miguel, Antonio
    Lleida, Eduardo
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8854 : 138 - 147
  • [8] Trade-off Between Computational Complexity and Accuracy in Evolutionary Image Feature Extraction
    Albukhanajer, Wissam A.
    Jin, Yaochu
    Briffa, Johan A.
    [J]. 2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 2412 - 2419
  • [9] Reducing Computational Complexity for Face Detection
    Sathyanarayana, Supriya
    Satzoda, Ravi Kumar
    Sathyanarayana, Suchitra
    Thambipillai, Srikanthan
    [J]. 2014 14TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS (ISIC), 2014, : 376 - 379
  • [10] Reducing computational complexity with array predicates
    Bernecky, R
    [J]. PROCEEDINGS OF THE APL98 CONFERENCE, 1999, 29 (03): : 39 - 43