TOWARDS MORE INTELLIGIBLE PHYSIOLOGICAL MICROPHONE SPEECH: A PROBABILISTIC TRANSFORMATION APPROACH

被引:0
|
作者
Sadjadi, Seyed Omid [1 ]
Patil, Sanjay A. [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA
关键词
Linear mapping; physiological microphone; probabilistic transformation; speech quality; objective quality measure;
D O I
10.1109/ICASSP.2010.5495167
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The non-acoustic physiological microphone (PMIC) has been shown to be useful for speech systems under adverse noisy conditions. However, the signal is not a true speech for the listener, therefore appears muffled and metallic with variations to the speaker dependent structure. This study presents a probabilistic transformation approach to improve the perceptual quality and intelligibility of PMIC speech not only by mapping the non-acoustic signal into the conventional speech production space, but also by minimizing distortions arising from alternative pickup location. Performance of the proposed approach is assessed based on five distinct objective metrics. Obtained results indicate that incorporating the probabilistic transformation yields significant improvement in overall PMIC speech quality and intelligibility. This technique along with the PMIC can thus find applications in noise robust human-to-human speech communication.
引用
收藏
页码:4730 / 4733
页数:4
相关论文
共 50 条
  • [21] Mapping Speech Spectra from Throat Microphone to Close-Speaking Microphone: A Neural Network Approach
    A. Shahina
    B. Yegnanarayana
    [J]. EURASIP Journal on Advances in Signal Processing, 2007
  • [22] Modifying the S2 Low-level Speech Synthesis Engine for More Intelligible Output
    Dedikova, Zuzana
    Cepko, Jozef
    [J]. PROCEEDINGS ELMAR-2010, 2010, : 367 - 370
  • [23] Mapping speech spectra from throat microphone to close- speaking microphone: A neural network approach
    Shahina, A.
    Yegnanarayana, B.
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [24] A Probabilistic Decoding Approach to a Neural Prosthesis for Speech
    Matthews, Brett
    Kim, Jonathan
    Brumberg, Jonathan S.
    Clements, Mark
    [J]. 2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [25] Towards more reality in the recognition of emotional speech
    Schuller, Bjoern
    Seppi, Dino
    Batliner, Anton
    Maier, Andreas
    Steidl, Stefan
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 941 - +
  • [26] A probabilistic approach for estimation of physiological movements in fMRI
    Kumazawa, S
    Yamamoto, T
    Dobashi, Y
    [J]. PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 2264 - 2267
  • [27] Speech enhancement in noise and within face mask (microphone array approach)
    Kang, GS
    Moran, TM
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1017 - 1020
  • [28] A Two Microphone-Based Approach for Speech Enhancement in Adverse Environments
    Li, Kai
    Guo, Yanmeng
    Fu, Qiang
    Yan, Yonghong
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2012, : 41 - 42
  • [29] Less Fluids and a More Physiological Approach
    Licker, Marc
    Triponez, Frederic
    Ellenberger, Christoph
    Karenovics, Wolfram
    [J]. TURKISH JOURNAL OF ANAESTHESIOLOGY AND REANIMATION, 2016, 44 (05) : 230 - 232
  • [30] Towards a more integrated approach
    Whitehand, J. W. R.
    [J]. URBAN MORPHOLOGY, 2006, 10 (02): : 87 - 88