Improvements on Punctuation Generation Inspired Linguistic Features for Mandarin Prosody Generation

被引:0
|
作者
Chiang, Chen-Yu [1 ]
Hung, Yu-Ping [1 ]
Liou, Guan-Ting [2 ]
Wang, Yih-Ru [2 ]
机构
[1] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
[2] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
oprosody generation; linguistic feature;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes two types of machine-extracted linguistic features from unlimited text input for Mandarin prosody generation. One is the improved punctuation confidence (iPC) which is a modified version of the previously proposed punctuation confidence that represents likelihood of inserting major punctuation marks (PMs) at word boundaries. Another is the quotation confidence (QC) which measures likelihood of a word string to be quoted as a meaningful or emphasized unit. Since major PMs are highly correlated with prosodic breaks, and a quoted Chinese word string plays an important role in human language understanding, the two features potentially could provide useful information for prosody generation. The idea is realized by employing conditional random field-based models to predict major PMs, quoted word string structures, and their associated confidences, i.e. iPC and QC. Then the predicted confidences are combined with traditional linguistic features to predict prosodic-acoustic features. Both objective and subjective tests showed that the prosody generation with the proposed linguistic features performed better than the ones without the proposed features.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Punctuation-generation-inspired linguistic features for Mandarin prosody generation
    Chen-Yu Chiang
    Yu-Ping Hung
    Han-Yun Yeh
    I-Bin Liao
    Chen-Ming Pan
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [2] Punctuation-generation-inspired linguistic features for Mandarin prosody generation
    Chiang, Chen-Yu
    Hung, Yu-Ping
    Yeh, Han-Yun
    Liao, I-Bin
    Pan, Chen-Ming
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)
  • [3] PUNCTUATION GENERATION INSPIRED LINGUISTIC FEATURES FOR MANDARIN PROSODIC BOUNDARY PREDICTION
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4597 - 4600
  • [4] An Investigation on Linguistic Features for Mandarin Prosody Generation
    Hung, Yu-Ping
    Yeh, Han-Yun
    Liao, I-Bin
    Pan, Chen-Ming
    Chiang, Chen-Yu
    [J]. 2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,
  • [5] A linguistically inspired statistical model for chinese punctuation generation
    Guo Y.
    Wang H.
    Genabith J.V.
    [J]. ACM Transactions on Asian Language Information Processing, 2010, 9 (02):
  • [6] Improving Mandarin Prosody Generation Using Alternative Smoothing Techniques
    Huang, Yi-Chin
    Wu, Chung-Hsien
    Weng, Si-Ting
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 1897 - 1907
  • [7] A combined punctuation generation and speech recognition system and its performance enhancement using prosody
    Kim, JH
    Woodland, PC
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 563 - 577
  • [8] Advanced Unsupervised Joint Prosody Labeling and Modeling for Mandarin Speech and Its Application to Prosody Generation for TTS
    Chiang, Chen-Yu
    Chen, Sin-Horng
    Wang, Yih-Ru
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 500 - 503
  • [9] High-Quality Prosody Generation in Mandarin Text-to-Speech System
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2010, 46 (01): : 40 - 46
  • [10] High-quality prosody generation in Mandarin text-to-speech system
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    [J]. Fujitsu Scientific and Technical Journal, 2010, 46 (01): : 40 - 46