A STUDY ON CROSS-LANGUAGE KNOWLEDGE INTEGRATION IN MANDARIN LVCSR

被引:0
|
作者
Chiang, Chen-Yu [1 ,4 ]
Siniscalchi, Sabato Marco [2 ]
Wang, Yih-Ru [1 ]
Chen, Sin-Horng [1 ]
Lee, Chin-Hui [3 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu, Taiwan
[2] Kore Univ Enna, Dept Telemat, Sicily, Italy
[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA USA
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
关键词
LVCSR; prosody modeling; attribute detector; knowledge integration; RECOGNITION; PROSODY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a cross-language knowledge integration framework to improve the performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, cross-lingual attribute detectors trained with an American English corpus (WSJ0) are utilized to verify and rescore hypothesized Mandarin syllables in word lattices obtained with state-of-the-art systems. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling technique using a Mandarin corpus (TCC300) are used in lattice rescoring. Experimental results on Mandarin syllable, character and word recognition with the TCC300 corpus show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. It also demonstrates a potential of utilizing results from cross-lingual attribute detectors as a language-universal frontend for automatic speech recognition.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [31] A study on speech feature extraction andapplication in Mandarin LVCSR
    Wang, An-Na
    Wang, Qin-Wan
    Tao, Ran
    Yuan, Wen-Jing
    Liu, Jun-Fang
    2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1072 - +
  • [32] ATTRIBUTE BASED SHARED HIDDEN LAYERS FOR CROSS-LANGUAGE KNOWLEDGE TRANSFER
    Arora, Vipul
    Lahiri, Aditi
    Reetz, Henning
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 617 - 623
  • [33] A cross-language study on citation practice in PhD theses
    Soler-Monreal, Carmen
    Gil-Salom, Luz
    INTERNATIONAL JOURNAL OF ENGLISH STUDIES, 2011, 11 (02): : 53 - 75
  • [34] NOVEL-WORD PRONUNCIATION - A CROSS-LANGUAGE STUDY
    SULLIVAN, KPH
    DAMPER, RI
    SPEECH COMMUNICATION, 1993, 13 (3-4) : 441 - 452
  • [35] CROSS-LANGUAGE STUDY OF SPEECH-PATTERN LEARNING
    SIMON, C
    FOURCIN, AJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (03): : 925 - 935
  • [36] CROSS-LANGUAGE STUDY OF PERCEPTION OF THE ORAL NASAL DISTINCTION
    BEDDOR, PS
    STRANGE, W
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (06): : 1551 - 1561
  • [37] ACCOR - INSTRUMENTATION AND DATABASE FOR THE CROSS-LANGUAGE STUDY OF COARTICULATION
    MARCHAL, A
    HARDCASTLE, WJ
    LANGUAGE AND SPEECH, 1993, 36 : 137 - 153
  • [38] Cross-Language Study of Vocal Correlates of Affective States
    Yanushevskaya, Irena
    Chasaide, Ailbhe Ni
    Gobl, Christer
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 330 - 333
  • [39] A Cross-Language Study of Perception of Lexical Stress in English
    Yu, Vickie Y.
    Andruski, Jean E.
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2010, 39 (04) : 323 - 344
  • [40] A Multilingual Study of Compressive Cross-Language Text Summarization
    Pontes, Elvys Linhares
    Huet, Stephane
    Torres-Moreno, Juan-Manuel
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2018, PT II, 2018, 11289 : 109 - 118