SPEAKER-INDEPENDENT CLASSIFICATION OF PHONETIC SEGMENTS FROM RAW ULTRASOUND IN CHILD SPEECH

被引：0

作者：

Ribeiro, Manuel Sam ^{[1
]}

Eshky, Aciel ^{[1
]}

Richmond, Korin ^{[1
]}

Renals, Steve ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

基金：

英国工程与自然科学研究理事会;

关键词：

ultrasound; ultrasound tongue imaging; speaker-independent; speech therapy; child speech;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manual tasks currently performed by speech therapists. A key challenge is to generalize the automatic processing of ultrasound tongue images to previously unseen speakers. In this work, we investigate the classification of phonetic segments (tongue shapes) from raw ultrasound recordings under several training scenarios: speaker-dependent, multi-speaker, speaker-independent, and speaker-adapted. We observe that models underperform when applied to data from speakers not seen at training time. However, when provided with minimal additional speaker information, such as the mean ultrasound frame, the models generalize better to unseen speakers.

引用

页码：1328 / 1332

页数：5

共 50 条

[1] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
Kim, WG
Jang, M
[J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
[2] Acoustic-phonetic speech parameters for speaker-independent speech recognition
Deshmukh, O
Espy-Wilson, CY
Juneja, A
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 593 - 596
[3] SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH
Schuster, Sebastian
Pancoast, Stephanie
Ganjoo, Milind
Frank, Michael C.
Jurafsky, Dan
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 366 - 371
[4] Zebra finches exhibit speaker-independent phonetic perception of human speech
Ohms, Verena R.
Gill, Arike
Van Heijningen, Caroline A. A.
Beckers, Gabriel J. L.
ten Cate, Carel
[J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2010, 277 (1684) : 1003 - 1009
[5] Iterative training techniques for phonetic template based speech recognition with a speaker-independent phonetic recognizer
Kim, WG
Jang, M
Lee, CH
[J]. AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 577 - 584
[6] An Acoustic-Phonetic-Based Speaker Adaptation Technique for Improving Speaker-Independent Continuous Speech Recognition
Zhao, Yunxin
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 380 - 394
[7] Phonetic knowledge embedded in a context sensitive MLP for French speaker-independent speech recognition
Djezzar, L
Pican, N
[J]. SPEECH COMMUNICATION, 1997, 21 (03) : 155 - 167
[8] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
GAUVAIN, JL
LAMEL, LF
ADDA, G
ADDADECKER, M
[J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
[9] On Speaker-Independent Personality Perception and Prediction from Speech
Polzehl, Tim
Schoenenberg, Katrin
Moeller, Sebastian
Metze, Florian
Mohammadi, Gelareh
Vinciarelli, Alessandro
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 258 - 261
[10] The study on continuous speech of speaker-independent
Ye Hong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924

← 1 2 3 4 5 →