Trends in audio signal feature extraction methods

被引:220
|
作者
Sharma, Garima [1 ]
Umapathy, Kartikeyan [1 ]
Krishnan, Sridhar [1 ]
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
Audio; Speech; Signal; Feature extraction; Survey; Machine learning; SPECTRAL-ANALYSIS; SPEECH ANALYSIS; CLASSIFICATION; TIME; RECOGNITION; MUSIC; BINARY; DISCRIMINATION; PREDICTION; RETRIEVAL;
D O I
10.1016/j.apacoust.2019.107020
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio signal processing algorithms generally involves analysis of signal, extracting its properties, predicting its behaviour, recognizing if any pattern is present in the signal, and how a particular signal is correlated to another similar signals. Audio signal includes music, speech and environmental sounds. Over the last few decades, audio signal processing has grown significantly in terms of signal analysis and classification. And it has been proven that solutions of many existing issues can be solved by integrating the modern machine learning (ML) algorithms with the audio signal processing techniques. The performance of any ML algorithm depends on the features on which the training and testing is done. Hence feature extraction is one of the most vital part of a machine learning process. The aim of this study is to summarize the literature of the audio signal processing specially focusing on the feature extraction techniques. In this survey the temporal domain, frequency domain, cepstral domain, wavelet domain and time-frequency domain features are discussed in detail. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Audio feature extraction and analysis for scene segmentation and classification
    Polytechnic Univ, Brooklyn, United States
    J VLSI Signal Process Syst Signal Image Video Technol, 1-2 (61-79):
  • [42] Pitch-based feature extraction for audio classification
    Abu-El-Quran, AR
    Goubran, RA
    2ND IEEE INTERNATIONAL WORKSHOP ON HAPTIC, AUDIO AND VISUAL ENVIRONMENTS AND THEIR APPLICATIONS - HAVE 2003, 2003, : 43 - 47
  • [43] Surfboard: Audio Feature Extraction for Modern Machine Learning
    Lenain, Raphael
    Weston, Jack
    Shivkumar, Abhishek
    Fristed, Emil
    INTERSPEECH 2020, 2020, : 2917 - 2921
  • [44] Audio Feature Extraction and Analysis for Scene Segmentation and Classification
    Zhu Liu
    Yao Wang
    Tsuhan Chen
    Journal of VLSI signal processing systems for signal, image and video technology, 1998, 20 : 61 - 79
  • [45] Audio feature extraction and analysis for scene segmentation and classification
    Liu, Z
    Wang, Y
    Chen, TH
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1998, 20 (1-2): : 61 - 79
  • [46] Subband conversion for feature extraction from compressed audio
    Friedrich, Tobias
    Gruhne, Matthias
    Schuller, Gerald
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 217 - 220
  • [47] AUDIO FEATURE EXTRACTION FOR VEHICLE ENGINE NOISE CLASSIFICATION
    Becker, Luca
    Nelus, Alexandra
    Gauer, Johannes
    Rudolph, Lars
    Martin, Rainer
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 711 - 715
  • [48] Enhanced Feature Extraction for Speech Detection in Media Audio
    Jang, Inseon
    Ahn, ChungHyun
    Seo, Jeongil
    Jang, Younseon
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 479 - 483
  • [49] Audio feature extraction and classification based on wavelet transform
    Xing, Feng
    Zheng, Jiming
    Wu, Yu
    Li, Jing
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 183 - 186
  • [50] On local feature extraction for signal classification
    Saito, N.
    Coifman, R.R.
    Zeitschrift fuer Angewandte Mathematik und Mechanik, ZAMM, Applied Mathematics and Mechanics, 76 (Suppl 2):