Variable time-scale modification of speech using transient information

被引:0
|
作者
Lee, SJ
Kim, HD
Kim, HS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Conventional time-scale modification methods have the problem that as the modification rate gets higher the time-scale modified speech signal becomes less intelligible, because they ignore the effect of articulation rate on speech characteristics. In this paper, we propose a variable time-scale modification method based on the knowledge that the timing information of transient portions of a speech signal plays an important role in speech perception. After identifying transient and steady portions of a speech signal, the proposed method gets the target rate by modifying steady portions only. The result of subjective preference test indicates that the proposed method produces performance superior to that of the conventional SOLA method.
引用
收藏
页码:1319 / 1322
页数:4
相关论文
共 50 条
  • [21] Data embedding in audio using time-scale modification
    Mansour, MF
    Tewfik, AH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 432 - 440
  • [22] Voice privacy using CycleGAN and time-scale modification
    Prajapati, Gauri P.
    Singh, Dipesh K.
    Amin, Preet P.
    Patil, Hemant A.
    COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [23] Source-filter models for time-scale pitch-scale modification of speech
    Acero, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 881 - 884
  • [24] Voice Privacy Using Time-Scale and Pitch Modification
    Singh D.K.
    Prajapati G.P.
    Patil H.A.
    SN Computer Science, 5 (2)
  • [25] TIME-SCALE MODIFICATION OF SPEECH BASED ON SHORT-TIME FOURIER-ANALYSIS
    PORTNOFF, MR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (03): : 374 - 390
  • [26] Time-scale modification of speech signals, for language-learning impaired children
    Erogul, O
    Karagoz, I
    PROCEEDINGS OF THE 1998 2ND INTERNATIONAL CONFERENCE BIOMEDICAL ENGINEERING DAYS, 1998, : 33 - 35
  • [27] FastMPEG: Time-scale modification of bit-compressed audio information
    Covell, M
    Slaney, M
    Rothstein, A
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3261 - 3264
  • [28] Using Data Augmentation and Time-Scale Modification to Improve ASR of Children's Speech in Noisy Environments
    Kathania, Hemant Kumar
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Kurimo, Mikko
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [29] Audio watermarking by time-scale modification
    Mansour, MF
    Tewfik, AH
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1353 - 1356
  • [30] Frequency Dependent Time-Scale Modification
    Roberts, Timothy
    Paliwal, Kuldip K.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,