Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech

被引：0

作者：

Wester, Mirjam ^{[1
]}

Liang, Hui ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

speaker discrimination; speaker adaptation; HMM-based speech synthesis; ADAPTATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes speaker discrimination experiments in which native English listeners were presented with natural speech stimuli in English and Mandarin, synthetic speech stimuli in English and Mandarin, or natural Mandarin speech and synthetic English speech stimuli. In each experiment, listeners were asked to judge whether the sentences in a pair were spoken by the same person or not. We found that the results of Mandarin/English speaker discrimination were very similar to those found in previous work on German/English and Finnish/English speaker discrimination. We conclude from this and previous work that listeners are able to discriminate between speakers across languages or across speech types, but the combination of these two factors leads to a speaker discrimination task that is too difficult for listeners to perform successfully, given the fact that the quality of across-language speaker adapted speech synthesis at present still needs to be improved.

引用

下载

页码：2492 / 2495

页数：4

共 50 条

[41] Cross-lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
Xin, Detai
Saito, Yuki
Takamichi, Shinnosuke
Koriyama, Tomoki
Saruwatari, Hiroshi
INTERSPEECH 2020, 2020, : 2947 - 2951
[42] Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling
Feng, Siyuan
Lee, Tan
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2000 - 2011
[43] DA-IICT Cross-lingual and Multilingual Corpora for Speaker Recognition
Patil, Hemant A.
Sitaram, Sunayana
Sharma, Esha
ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 187 - 190
[44] Residual Phase Cepstrum Coefficients with Application to Cross-lingual Speaker Verification
Wang, Jianglin
Johnson, Michael T.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1554 - 1557
[45] UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION
Gibson, Matthew
Hirsimaki, Teemu
Karhila, Reima
Kurimo, Mikko
Byrne, William
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4642 - 4645
[46] Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping
Oura, Keiichiro
Yamagishi, Junichi
Wester, Mirjam
King, Simon
Tokuda, Keiichi
SPEECH COMMUNICATION, 2012, 54 (06) : 703 - 714
[47] Identifying Agreement/Disagreement in Conversational Speech: A Cross-lingual Study
Wang, Wen
Precoda, Kristin
Richey, Colleen
Raymond, Geoffrey
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3100 - +
[48] CROSS-LINGUAL AND MULTILINGUAL SPEECH EMOTION RECOGNITION ON ENGLISH AND FRENCH
Neumann, Michael
Ngoc Thang Vu
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5769 - 5773
[49] Cross-lingual Speech Emotion Recognition through Factor Analysis
Desplanques, Brecht
Demuynck, Kris
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3648 - 3652
[50] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
Chen, Qiang
Li, Wenjie
Lei, Yu
Liu, Xule
Luo, Chuwei
He, Yanxiang
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67

← 1 2 3 4 5 →