Variable time-scale modification of speech using transient information

被引:0
|
作者
Lee, SJ
Kim, HD
Kim, HS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Conventional time-scale modification methods have the problem that as the modification rate gets higher the time-scale modified speech signal becomes less intelligible, because they ignore the effect of articulation rate on speech characteristics. In this paper, we propose a variable time-scale modification method based on the knowledge that the timing information of transient portions of a speech signal plays an important role in speech perception. After identifying transient and steady portions of a speech signal, the proposed method gets the target rate by modifying steady portions only. The result of subjective preference test indicates that the proposed method produces performance superior to that of the conventional SOLA method.
引用
收藏
页码:1319 / 1322
页数:4
相关论文
共 50 条
  • [31] Time-scale modification of music signals
    Grofit, S
    Lavner, Y
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 254 - 256
  • [32] Stereo Time-Scale Modification Using Sum and Difference Transformation
    Roberts, Timothy
    Paliwal, Kuldip K.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [33] Time-scale modification of music using a subband approach based on the bark scale
    Dorran, D
    Lawlor, R
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 173 - 176
  • [34] Automated detection of transition segments for intensity and time-scale modification for speech intelligibility enhancement
    Jayan, A. R.
    Pandey, P. C.
    Lehana, P. K.
    ICSCN 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING COMMUNICATIONS AND NETWORKING, 2008, : 63 - 68
  • [35] Speech-adaptive time-scale modification for computer assisted language-learning
    Donnellan, O
    Jung, E
    Coyle, E
    3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2003, : 165 - 169
  • [36] Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
    Pollard, MP
    Cheetham, BMG
    Goodyear, CC
    Edgington, MD
    Lowry, A
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1433 - 1436
  • [37] High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA)
    Dorran, D
    Lawlor, R
    Coyle, E
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 700 - 703
  • [38] A Review of Time-Scale Modification of Music Signals
    Driedger, Jonathan
    Mueller, Meinard
    APPLIED SCIENCES-BASEL, 2016, 6 (02):
  • [39] Speaking rate control based on time-scale modification and its effects on the performance of speech recognition
    Kang, Jin Ah
    Choi, Seung Ho
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2014, 6 (1-2) : 31 - 36
  • [40] Time-scale modification of music using a synchronized subband/time-domain approach
    Dorran, D
    Lawlor, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 225 - 228