A STUDY ON CROSS-LANGUAGE KNOWLEDGE INTEGRATION IN MANDARIN LVCSR

被引:0
|
作者
Chiang, Chen-Yu [1 ,4 ]
Siniscalchi, Sabato Marco [2 ]
Wang, Yih-Ru [1 ]
Chen, Sin-Horng [1 ]
Lee, Chin-Hui [3 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu, Taiwan
[2] Kore Univ Enna, Dept Telemat, Sicily, Italy
[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA USA
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
关键词
LVCSR; prosody modeling; attribute detector; knowledge integration; RECOGNITION; PROSODY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a cross-language knowledge integration framework to improve the performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, cross-lingual attribute detectors trained with an American English corpus (WSJ0) are utilized to verify and rescore hypothesized Mandarin syllables in word lattices obtained with state-of-the-art systems. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling technique using a Mandarin corpus (TCC300) are used in lattice rescoring. Experimental results on Mandarin syllable, character and word recognition with the TCC300 corpus show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. It also demonstrates a potential of utilizing results from cross-lingual attribute detectors as a language-universal frontend for automatic speech recognition.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [21] A CROSS-LANGUAGE STUDY OF VOWEL SPACES AND INTERFERENCE
    JONGMAN, A
    FOURAKIS, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S66 - S66
  • [22] Glottalization in inventory construction: A cross-language study
    Ding, H
    Jokisch, O
    Hoffmann, R
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 37 - 40
  • [23] CROSS-LANGUAGE PSYCHOLINGUISTICS
    CUTLER, A
    LINGUISTICS, 1985, 23 (05) : 659 - 667
  • [24] CROSS-LANGUAGE STUDY OF SPEECH PATTERN LEARNING
    SIMON, C
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 61 : S64 - S64
  • [25] Language and cognition: A cross-language perspective
    Chen, HC
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 148 - 148
  • [26] Musical Rhythm Perception and Production, Phonological Awareness, and Vocabulary Knowledge in Preschoolers: A Cross-Language Study
    Chih-Hsuan Tsao
    Ya-Hsin Lai
    Yu-Ling Chen
    Hsiao-Lan Sharon Wang
    International Journal of Early Childhood, 2023, 55 : 27 - 46
  • [27] Musical Rhythm Perception and Production, Phonological Awareness, and Vocabulary Knowledge in Preschoolers: A Cross-Language Study
    Tsao, Chih-Hsuan
    Lai, Ya-Hsin
    Chen, Yu-Ling
    Wang, Hsiao-Lan Sharon
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD, 2023, 55 (01) : 27 - 46
  • [28] CROSS-LANGUAGE KNOWLEDGE SHARING MODEL BASED ON ONTOLOGIES AND LOGICAL INFERENCE
    Guo, Weisen
    Kraines, Steven B.
    MANAGING KNOWLEDGE FOR GLOBAL AND COLLABORATIVE INNOVATIONS, 2010, 8 : 207 - 219
  • [29] Cross-language latent relational search: Mapping knowledge across languages
    University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
    Proc Natl Conf Artif Intell, (1237-1242):
  • [30] Knowledge acquisition from comparable corpora for cross-language information retrieval
    Fatiha, Sadat
    INFORMATION MANAGEMENT IN THE MODERN ORGANIZATIONS: TRENDS & SOLUTIONS, VOLS 1 AND 2, 2008, : 745 - 747