Toward a rule-based synthesis of emotional speech on linguistic descriptions of perception

被引:0
|
作者
Huang, CF [1 ]
Akagi, M [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports rules for morphing a voice to make it be perceived as containing various primitive features, for example, to make it sound more "bright" or "dark". In a previous work we proposed a three-layered model, which contains emotional speech, primitive features, and acoustic features, for the perception of emotional speech. By experiments and acoustic analysis, we built the relationships between the three layers and reported that such relationships are significant. Then, a bottom-up method was adopted in order to verify the relationships. That is, we morphed (resynthesized) a speech voice by composing acoustic features in the bottommost layer to produce a voice in which listeners could perceive a single or multiple primitive features, which could be further perceived as different categories of emotion. The intermediate results show that the relationships of the model built in previous work are valid.
引用
收藏
页码:366 / 373
页数:8
相关论文
共 50 条
  • [1] A Rule-Based Speech Morphing for Verifying a Expressive Speech Perception Model
    Huang, Chun-Fang
    Akagi, Masato
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1221 - 1224
  • [2] Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space
    Xue, Yawen
    Hamada, Yasuhiro
    Akagi, Masato
    [J]. SPEECH COMMUNICATION, 2018, 102 : 54 - 67
  • [3] A Pronunciation Rule-Based Speech Synthesis Technique for Odia Numerals
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 483 - 491
  • [4] Rule-Based Storytelling Text-to-Speech (TTS) Synthesis
    Ramli, Izzad
    Seman, Noraini
    Ardi, Norizah
    Jamil, Nursuriati
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON MECHANICS AND MECHATRONICS RESEARCH (ICMMR 2016), 2016, 77
  • [5] The linguistic basis of a rule-based tagger of Czech
    Oliva, K
    Hnátková, M
    Petkevic, V
    Kveton, P
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 3 - 8
  • [6] An Optimization of Fundamental Frequency and Length of Syllables for Rule-Based Speech Synthesis
    Win, Kyawt Yin
    Takara, Tomio
    [J]. FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 114 - 124
  • [7] Myanmar text-to-speech system with rule-based tone synthesis
    Win, Kyawt Yin
    Takara, Tomio
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (05) : 174 - 181
  • [8] TOWARD A RULE-BASED BIOME MODEL
    NEILSON, RP
    KING, GA
    KOERPER, G
    [J]. LANDSCAPE ECOLOGY, 1992, 7 (01) : 27 - 43
  • [9] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    [J]. INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
  • [10] A RULE-BASED EXPERT SYSTEM FOR MUSIC PERCEPTION
    JONES, JA
    MILLER, BO
    SCARBOROUGH, DL
    [J]. BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1988, 20 (02): : 255 - 262