Mixed wideband speech and music coding using a speech/music discriminator

被引:0
|
作者
Qiao, RY [1 ]
机构
[1] CSIRO, Epping, NSW 2121, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multimedia applications such as videoconferencing, users are demanding higher quality speech/audio transmission than the POTS can offer. 7 kHz wideband speech/audio offers a good compromise between bandwidth and sound quality. It improves the intelligibility and naturalness of speech and adds a feeling of transparent communication. Currently the only existing international standard for coding such signals is the G.722 wideband speech/audio coder. While its coding quality is satisfactory, it leaves much to be desired with its bit rate. CELP-based approach has been very successful in telephone bandwidth speech coding, but is not suitable for coding non-speech signals because of the assumed signal production model. This paper proposes an alternative approach to mixed speech/music coding, which uses a discriminator to separate music signals from speech, and codes them with the G.722 coder and a G.723.1-based speech coder, respectively. Simulations shows very promising results.
引用
收藏
页码:605 / 608
页数:4
相关论文
共 50 条
  • [1] A ROBUST SPEECH/MUSIC DISCRIMINATOR FOR SWITCHED AUDIO CODING
    Fuchs, Guillaume
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 569 - 573
  • [2] A speech/music discriminator for radio recordings using Bayesian networks
    Giannakopoulos, Theodoros
    Pikrakis, Aggelos
    Theodoridis, Sergios
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5667 - 5670
  • [3] A robust and computationally efficient Speech/Music discriminator
    Jayme, Garcia Arnal Barbedo
    Lopes, Amauri
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (7-8): : 571 - 588
  • [4] Design of an efficient music-speech discriminator
    Tardon, Lorenzo J.
    Sammartino, Simone
    Barbancho, Isabel
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (01): : 271 - 279
  • [5] A speech-music discriminator using HILN model based features
    Thoshkahna, Balaji
    Sudha, V
    Ramakrishnan, K. R.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5283 - 5286
  • [6] Construction and evaluation of a robust multifeature speech/music discriminator
    Scheirer, E
    Slaney, M
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1331 - 1334
  • [7] Robust singing detection in speech/music discriminator design
    Chou, W
    Gu, L
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 865 - 868
  • [8] A real-time speech-music discriminator
    Aarts, RM
    Dekkers, RT
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1999, 47 (09): : 720 - 725
  • [9] AN EFFICIENT BACKWARD CODING FOR WIDEBAND SPEECH AND MUSIC SIGNALS (ADPCM-AB)
    HAYASHI, S
    HONDA, M
    KITAWAKI, N
    [J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1988, 36 (03): : 363 - 368
  • [10] Audio coding improvement using evolutionary speech/music discrimination
    Exposito, J. E. Munoz
    Galan, S. Garcia
    Reyes, N. Ruiz
    Candeas, R. Vera
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-4, 2007, : 822 - 827