Audio coding improvement using evolutionary speech/music discrimination

被引:0
|
作者
Exposito, J. E. Munoz [1 ]
Galan, S. Garcia [1 ]
Reyes, N. Ruiz [1 ]
Candeas, R. Vera [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech/music discrimination is an important tool used in many multimedia applications, becoming a research topic of interest in the last years. This paper presents our last works in the speech/music discrimination field, aiming to improve the coding efficiency of standard audio coders (i.e. MP3, AAC) when speech and music signals are involved. In order to discriminate between speech and music, a fuzzy rules-based expert system is incorporated into the decision-taking stage of traditional speech/music discrimination systems. The knowledge base of the fuzzy expert system has been obtained by means of a typical genetic learning algorithm (the Pittsburgh algorithm). The proposed speech/music discrimination scheme manages the operation of an intelligent audio coder, which selects a GSM coder for speech frames and an AAC coder for music ones, resulting in a lower bit rate regarding the case of using a standardized audio coder (AAC in this work). Further, the intelligent audio coder has been designed aiming to obtain a similar subjective audio quality than AAC. GSM operates at 13 kbits/s, while in the experiments the bit rate specification for AAC has been 32 kbits/s for one-channel audio signals.
引用
收藏
页码:822 / 827
页数:6
相关论文
共 50 条
  • [1] Combined speech and audio coding by discrimination
    Tancerel, L
    Ragot, S
    Ruoppila, VT
    Lefebvre, R
    2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 154 - 156
  • [2] SPEECH/MUSIC DISCRIMINATION BASED ON WARPING TRANSFORMATION AND FUZZY LOGIC FOR INTELLIGENT AUDIO CODING
    Enrique Munoz-Exposito, Jose
    Garcia Galan, Sebastian
    Ruiz Reyes, Nicolas
    Vera Candeas, Pedro
    APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (05) : 427 - 442
  • [3] Speech/music classification based on distributed evolutionary fuzzy logic for intelligent audio coding
    Munoz Exposito, J. E.
    Ruiz Reyes, N.
    Garcia Galan, S.
    Vera Candeas, P.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 556 - +
  • [4] Speech/Music Discrimination in Audio Podcast Using Structural Segmentation and Timbre Recognition
    Barthet, Mathieu
    Hargreaves, Steven
    Sandler, Mark
    EXPLORING MUSIC CONTENTS, 2011, 6684 : 138 - 162
  • [5] A ROBUST SPEECH/MUSIC DISCRIMINATOR FOR SWITCHED AUDIO CODING
    Fuchs, Guillaume
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 569 - 573
  • [6] Robust signal/noise discrimination for wideband speech and audio coding
    Jelinek, M
    Labonté, F
    2000 IEEE WORKSHOP ON SPEECH CODING, PROCEEDINGS: MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 2000, : 151 - 153
  • [7] A dynamic programming approach to audio segmentation and speech/music discrimination
    Goodwin, MM
    Laroche, J
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 309 - 312
  • [8] Speech/music discrimination-based audio characterization using blind watermarking scheme
    Mezghani, Eya
    Charfeddine, Maha
    Nicolas, Henri
    Ben Amar, Chokri
    JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2016, 11 (06): : 311 - 321
  • [9] Audio signal discrimination using evolutionary spectrum
    Al-Shoshan, A.I.
    International Journal of Computers and Applications, 2009, 31 (02) : 69 - 73
  • [10] Improvement to speech-music discrimination using sinusoidal model based features
    Shirazi, Jalil
    Ghaemmaghami, Shahrokh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 50 (02) : 415 - 435