Audio coding improvement using evolutionary speech/music discrimination

被引:0
|
作者
Exposito, J. E. Munoz [1 ]
Galan, S. Garcia [1 ]
Reyes, N. Ruiz [1 ]
Candeas, R. Vera [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech/music discrimination is an important tool used in many multimedia applications, becoming a research topic of interest in the last years. This paper presents our last works in the speech/music discrimination field, aiming to improve the coding efficiency of standard audio coders (i.e. MP3, AAC) when speech and music signals are involved. In order to discriminate between speech and music, a fuzzy rules-based expert system is incorporated into the decision-taking stage of traditional speech/music discrimination systems. The knowledge base of the fuzzy expert system has been obtained by means of a typical genetic learning algorithm (the Pittsburgh algorithm). The proposed speech/music discrimination scheme manages the operation of an intelligent audio coder, which selects a GSM coder for speech frames and an AAC coder for music ones, resulting in a lower bit rate regarding the case of using a standardized audio coder (AAC in this work). Further, the intelligent audio coder has been designed aiming to obtain a similar subjective audio quality than AAC. GSM operates at 13 kbits/s, while in the experiments the bit rate specification for AAC has been 32 kbits/s for one-channel audio signals.
引用
收藏
页码:822 / 827
页数:6
相关论文
共 50 条
  • [21] An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder
    Yang, Wanzhao
    Tu, Weiping
    Zheng, Jiaxi
    Zhang, Xiong
    Yang, Yuhong
    Song, Yucheng
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 81 - 92
  • [22] Wideband speech and audio coding using gammatone filter banks
    Ambikairajah, E
    Epps, J
    Lin, L
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 773 - 776
  • [23] Speech and Music Discrimination Using Spectral Transition Rate
    Yang, Kyong-Chul
    Bang, Yong-Chan
    Cho, Sun-Ho
    Yook, Dongsuk
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (03): : 273 - 278
  • [24] On the Discrimination of Speech/Music using a Time Series Regularity
    Swe, Ei Mon Mon
    Pwint, Moe
    Sattar, Farook
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 53 - +
  • [25] Advances in Speech and Audio Processing and Coding
    Spanias, Andreas
    2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
  • [26] MPEG Unified Speech and Audio Coding
    Quackenbush, Schuyler
    IEEE MULTIMEDIA, 2013, 20 (02) : 72 - 78
  • [27] COMPARISON OF WINDOWING IN SPEECH AND AUDIO CODING
    Baeckstroem, Tom
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [28] Speech/audio coding technologies and their applications
    Kaneko, Takao, 2000, NTT, Tokyo, Japan (49):
  • [29] Audio coding and electronic distribution of music
    Brandenburg, K
    SECOND INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2002, : 3 - 5
  • [30] Security Improvement for Audio Watermarking in Image Using BCH Coding
    Fazli, Amir R.
    Eghbali, Mohammad M.
    Kazemi, Zahra
    Sarbisheie, Ghazaleh
    PROCEEDINGS OF THE 2012 8TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2012,