Audio coding improvement using evolutionary speech/music discrimination

被引:0
|
作者
Exposito, J. E. Munoz [1 ]
Galan, S. Garcia [1 ]
Reyes, N. Ruiz [1 ]
Candeas, R. Vera [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech/music discrimination is an important tool used in many multimedia applications, becoming a research topic of interest in the last years. This paper presents our last works in the speech/music discrimination field, aiming to improve the coding efficiency of standard audio coders (i.e. MP3, AAC) when speech and music signals are involved. In order to discriminate between speech and music, a fuzzy rules-based expert system is incorporated into the decision-taking stage of traditional speech/music discrimination systems. The knowledge base of the fuzzy expert system has been obtained by means of a typical genetic learning algorithm (the Pittsburgh algorithm). The proposed speech/music discrimination scheme manages the operation of an intelligent audio coder, which selects a GSM coder for speech frames and an AAC coder for music ones, resulting in a lower bit rate regarding the case of using a standardized audio coder (AAC in this work). Further, the intelligent audio coder has been designed aiming to obtain a similar subjective audio quality than AAC. GSM operates at 13 kbits/s, while in the experiments the bit rate specification for AAC has been 32 kbits/s for one-channel audio signals.
引用
收藏
页码:822 / 827
页数:6
相关论文
共 50 条
  • [31] Performance Improvement of Signal Classifiers for Speech/Audio Coding by Higher-Order Statistics
    Cho, Keunseok
    Hahn, Minsoo
    Jeong, Sangbae
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 821 - +
  • [32] Speech and music classification in audio documents
    Pinquier, J
    Sénac, C
    André-Obrecht, R
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4164 - 4164
  • [33] MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION
    Sell, Gregory
    Clark, Pascal
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [34] Universal speech/audio coding using hybrid ACELP/TCX techniques
    Bessette, B
    Lefebvre, R
    Salami, R
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 301 - 304
  • [35] Combined speech and audio coding using non-linear adaptations
    Chan, CF
    1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 105 - 106
  • [36] Audio and Speech Compression using Sinusoidal Modeling and Wavelet Residuum Coding
    Nagy, Martin Turi
    Vargic, Radoslav
    PROCEEDINGS ELMAR-2012, 2012, : 207 - 210
  • [37] Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding
    Das, Sneha
    Backstrom, Tom
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3543 - 3547
  • [38] Speech vs Music Discrimination using Empirical Mode Decomposition
    Khonglah, Banriskhem K.
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [39] Speech music discrimination using class-specific features
    Beierholm, T
    Baggenstoss, PM
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 379 - 382
  • [40] Discrimination between Speech and Music Using Time Series Events
    Alnadabi, Muhammad
    Johnstone, Sherri
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 565 - +