Toward a rule-based synthesis of emotional speech on linguistic descriptions of perception

被引：0

作者：

Huang, CF ^{[1
]}

Akagi, M ^{[1
]}

机构：

[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa, Japan

来源：

AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS | 2005年 / 3784卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper reports rules for morphing a voice to make it be perceived as containing various primitive features, for example, to make it sound more "bright" or "dark". In a previous work we proposed a three-layered model, which contains emotional speech, primitive features, and acoustic features, for the perception of emotional speech. By experiments and acoustic analysis, we built the relationships between the three layers and reported that such relationships are significant. Then, a bottom-up method was adopted in order to verify the relationships. That is, we morphed (resynthesized) a speech voice by composing acoustic features in the bottommost layer to produce a voice in which listeners could perceive a single or multiple primitive features, which could be further perceived as different categories of emotion. The intermediate results show that the relationships of the model built in previous work are valid.

引用

页码：366 / 373

页数：8

共 50 条

[1] A Rule-Based Speech Morphing for Verifying a Expressive Speech Perception Model
Huang, Chun-Fang
Akagi, Masato
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1221 - 1224
[2] Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional space
Xue, Yawen
Hamada, Yasuhiro
Akagi, Masato
[J]. SPEECH COMMUNICATION, 2018, 102 : 54 - 67
[3] A Pronunciation Rule-Based Speech Synthesis Technique for Odia Numerals
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
[J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 483 - 491
[4] Rule-Based Storytelling Text-to-Speech (TTS) Synthesis
Ramli, Izzad
Seman, Noraini
Ardi, Norizah
Jamil, Nursuriati
[J]. 2016 3RD INTERNATIONAL CONFERENCE ON MECHANICS AND MECHATRONICS RESEARCH (ICMMR 2016), 2016, 77
[5] The linguistic basis of a rule-based tagger of Czech
Oliva, K
Hnátková, M
Petkevic, V
Kveton, P
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 3 - 8
[6] An Optimization of Fundamental Frequency and Length of Syllables for Rule-Based Speech Synthesis
Win, Kyawt Yin
Takara, Tomio
[J]. FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 114 - 124
[7] Myanmar text-to-speech system with rule-based tone synthesis
Win, Kyawt Yin
Takara, Tomio
[J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (05) : 174 - 181
[8] TOWARD A RULE-BASED BIOME MODEL
NEILSON, RP
KING, GA
KOERPER, G
[J]. LANDSCAPE ECOLOGY, 1992, 7 (01) : 27 - 43
[9] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
[J]. INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
[10] A RULE-BASED EXPERT SYSTEM FOR MUSIC PERCEPTION
JONES, JA
MILLER, BO
SCARBOROUGH, DL
[J]. BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1988, 20 (02): : 255 - 262

← 1 2 3 4 5 →