Audio Effect for Highlighting Speaker's Voice Corrupted by Background Noise on Portable Digital Imaging Devices

被引:0
|
作者
Kang, Jin Ah [1 ]
Chun, Chan Jun [1 ]
Kim, Hong Kook [1 ]
Kim, Ji Woon [2 ]
Kim, Myeong Bo [2 ]
机构
[1] Gwangju Inst Sci & Technol GIST, Sch Informat & Commun, Kwangju 500712, South Korea
[2] Samsung Elect, Digital Image Business, Camcorder Business Team, Gyenggido 443742, South Korea
基金
新加坡国家研究基金会;
关键词
Audio effect; audio content classification; adaptive scaling; speech enhancement; portable digital imaging devices;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an audio effect (AE) algorithm is proposed which can be applied to portable digital imaging devices to enjoy video contents effectively. The proposed AE algorithm enhances speech signals corrupted by background noise in audio content based on audio content classification (ACC) and the signal-to-noise ratio (SNR) estimation in order to highlight speaker's voice. The ACC classifies each short segment of audio content as speech, non-speech, or mixed signal by using the parameters such as signal energy, sub-band energy, and residual signal energy obtained from the linear prediction analysis. Then, we adaptively scale the signals according to the classification and the estimated SNR. To show the effectiveness of the proposed AE algorithm, we perform an informal listening test between the original audio contents and their processed versions by the proposed AE algorithm. Consequently, it is shown that the proposed AE algorithm significantly improves audio quality.
引用
下载
收藏
页码:39 / +
页数:2
相关论文
共 7 条
  • [1] A Smart Background Music Mixing Algorithm for Portable Digital Imaging Devices
    Kang, Jin Ah
    Chun, Chan Jun
    Kim, Hong Kook
    Kim, Myeong Bo
    Kim, Sang Ryong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (03) : 1258 - 1263
  • [2] Is children's listening effort in background noise influenced by the speaker's voice quality?
    Sahlen, Birgitta
    Haake, Magnus
    von Lochow, Heike
    Holm, Lucas
    Kastberg, Tobias
    Brannstrom, K. Jonas
    Lyberg-Ahlander, Viveka
    LOGOPEDICS PHONIATRICS VOCOLOGY, 2018, 43 (02) : 47 - 55
  • [3] A Voice-Driven Scene-Mode Recommendation Service for Portable Digital Imaging Devices
    Oh, Yoo Rhee
    Yoon, Jae Sam
    Kim, Hong Kook
    Kim, Myung Bo
    Kim, Sang Ryong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (04) : 1739 - 1747
  • [4] A User Voice Reduction Algorithm Based on Binaural Signal Separation for Portable Digital Imaging Devices
    Park, Ji Hun
    Kim, Hong Kook
    Kim, Myeong Bo
    Kim, Sang Ryong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 679 - 684
  • [5] Design and Implementation of a Video-Zoom Driven Digital Audio-Zoom System for Portable Digital Imaging Devices
    Park, Nam In
    Kim, Seon Man
    Kim, Hong Kook
    Kim, Ji Woon
    Kim, Myeong Bo
    Yun, Su Won
    SIGNAL PROCESSING AND MULTIMEDIA, 2010, 123 : 165 - +
  • [6] Tonal noise reduction based on a psychoacoustic model and its implementation on portable digital imaging devices
    Park, N. I. (naminpark@gist.ac.kr), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (06):
  • [7] Effect of background trends removal on noise power spectrum measurements in digital x-ray imaging
    Zhou, Zhongxing
    Gao, Feng
    Zhao, Huijuan
    Zhang, Lixin
    ADVANCED BIOMEDICAL AND CLINICAL DIAGNOSTIC SYSTEMS IX, 2011, 7890