SPEAKER-INDEPENDENT CLASSIFICATION OF PHONETIC SEGMENTS FROM RAW ULTRASOUND IN CHILD SPEECH

被引:0
|
作者
Ribeiro, Manuel Sam [1 ]
Eshky, Aciel [1 ]
Richmond, Korin [1 ]
Renals, Steve [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
ultrasound; ultrasound tongue imaging; speaker-independent; speech therapy; child speech;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manual tasks currently performed by speech therapists. A key challenge is to generalize the automatic processing of ultrasound tongue images to previously unseen speakers. In this work, we investigate the classification of phonetic segments (tongue shapes) from raw ultrasound recordings under several training scenarios: speaker-dependent, multi-speaker, speaker-independent, and speaker-adapted. We observe that models underperform when applied to data from speakers not seen at training time. However, when provided with minimal additional speaker information, such as the mean ultrasound frame, the models generalize better to unseen speakers.
引用
收藏
页码:1328 / 1332
页数:5
相关论文
共 50 条
  • [1] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [2] Acoustic-phonetic speech parameters for speaker-independent speech recognition
    Deshmukh, O
    Espy-Wilson, CY
    Juneja, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 593 - 596
  • [3] SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH
    Schuster, Sebastian
    Pancoast, Stephanie
    Ganjoo, Milind
    Frank, Michael C.
    Jurafsky, Dan
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 366 - 371
  • [4] Zebra finches exhibit speaker-independent phonetic perception of human speech
    Ohms, Verena R.
    Gill, Arike
    Van Heijningen, Caroline A. A.
    Beckers, Gabriel J. L.
    ten Cate, Carel
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2010, 277 (1684) : 1003 - 1009
  • [5] Iterative training techniques for phonetic template based speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    Lee, CH
    [J]. AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 577 - 584
  • [6] An Acoustic-Phonetic-Based Speaker Adaptation Technique for Improving Speaker-Independent Continuous Speech Recognition
    Zhao, Yunxin
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 380 - 394
  • [7] Phonetic knowledge embedded in a context sensitive MLP for French speaker-independent speech recognition
    Djezzar, L
    Pican, N
    [J]. SPEECH COMMUNICATION, 1997, 21 (03) : 155 - 167
  • [8] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
    GAUVAIN, JL
    LAMEL, LF
    ADDA, G
    ADDADECKER, M
    [J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
  • [9] On Speaker-Independent Personality Perception and Prediction from Speech
    Polzehl, Tim
    Schoenenberg, Katrin
    Moeller, Sebastian
    Metze, Florian
    Mohammadi, Gelareh
    Vinciarelli, Alessandro
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 258 - 261
  • [10] The study on continuous speech of speaker-independent
    Ye Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924