A Digital Signal Processor Implementation of Silent/Electrolaryngeal Speech Enhancement based on Real-Time Statistical Voice Conversion

被引:0
|
作者
Moriguchi, Takuto [1 ]
Toda, Tomoki [1 ]
Sano, Motoaki [2 ]
Sato, Hiroshi [2 ]
Neubig, Graham [1 ]
Sakti, Sakriani [1 ]
Nakamura, Satoshi [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara, Japan
[2] Foster Elect Co Ltd, Akishima, Tokyo, Japan
关键词
statistical voice conversion; real-time processing; reduction of computational cost; DSP; non-audible murmur; electrolaryngeal speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a digital signal processor (DSP) implementation of real-time statistical voice conversion (VC) for silent speech enhancement and electrolaryngeal speech enhancement. As a silent speech interface, we focus on non audible murmur (NAM), which can be used in situations where audible speech is not acceptable. Electrolaryngeal speech is one of the typical types of alaryngeal speech produced by an alternative speaking method for laryngectornees. However, the sound quality of NAM and electrolaryngeal speech suffers from lack of naturalness. VC has proven to be one of the promising approaches to address this problem, and it has been successfully implemented on devices with sufficient computational resources. An implementation on devices that are highly portable but have limited computational resources would greatly contribute to its practical use. In this paper we further implement real-time VC on a DSP. To implement the two speech enhancement systems based on real-time VC, one from NAM to a whispered voice and the other from electrolaryngeal speech to a natural voice, we propose several methods for reducing computational cost while preserving conversion accuracy. We conduct experimental evaluations and show that real-time VC is capable of running on a DSP with little degradation.
引用
收藏
页码:3071 / 3075
页数:5
相关论文
共 50 条
  • [41] IMPLEMENTATION OF A BIPOLAR REAL-TIME IMAGE SIGNAL PROCESSOR - RISP-II
    AONO, K
    MARUYAMA, M
    MORI, T
    YAMADA, H
    HATAYA, K
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1987, 22 (03) : 403 - 408
  • [42] DSP Real-Time Implementation of DOST Algorithm Used for Speech Enhancement
    Saoud, Safa
    Bousselmi, Souha
    Ben Nasr, Mouhamed
    Cherif, Adnen
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT'18), VOL.2, 2020, 147 : 77 - 88
  • [43] Real-time turbo-decoding of product codes on a digital signal processor
    Goalic, A
    Pyndiah, R
    GLOBECOM 97 - IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, CONFERENCE RECORD, VOLS 1-3, 1997, : 624 - 628
  • [44] A REAL-TIME ADAPTIVE LATTICE PREDICTOR USING A DIGITAL SIGNAL PROCESSOR CHIP
    KIM, SH
    HONG, KR
    CHOI, YH
    HONG, WH
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1989, 38 (05) : 1016 - 1019
  • [45] Real-time power quality waveform recognition with a programmable digital signal processor
    Wang, M
    Rowe, GI
    Mamishev, AV
    2003 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-4, CONFERENCE PROCEEDINGS, 2003, : 1268 - 1273
  • [46] LOW-COST REAL-TIME SERVICE DIGITAL SIGNAL PROCESSOR.
    Cohn-Sfetcu, Sorin
    Doyle, John
    IEEE Transactions on Communications, 1978, COM-26 (05): : 626 - 631
  • [47] An Ultrafast Digital Signal Processor for Millimeter Wave Real-time Imaging Radar
    Shi, Qingzhan
    Zhang, Deping
    Cheng, Shiliang
    Luo, Hui
    Yuan, Naichang
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2015, : 306 - 309
  • [48] PERFORMANCE OF THE IMS-A100 DIGITAL SIGNAL PROCESSOR FOR REAL-TIME DECONVOLUTION
    SANCHEZ, T
    ANAYA, JJ
    FRITSCH, C
    MICROPROCESSORS AND MICROSYSTEMS, 1994, 18 (06) : 315 - 322
  • [49] REAL-TIME VOICE CONVERSION BASED ON INSTANTANEOUS HARMONIC PARAMETERS
    Azarov, Elias
    Petrovsky, Alexander
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5140 - 5143
  • [50] A Real-Time Speech Enhancement Processor for Hearing Aids in 28-nm CMOS
    Park, Sungjin
    Lee, Sunwoo
    Park, Jeongwoo
    Choi, Hyeong-Seok
    Lee, Kyogu
    Jeon, Dongsuk
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024,