A STUDY ON CROSS-LANGUAGE KNOWLEDGE INTEGRATION IN MANDARIN LVCSR

被引:0
|
作者
Chiang, Chen-Yu [1 ,4 ]
Siniscalchi, Sabato Marco [2 ]
Wang, Yih-Ru [1 ]
Chen, Sin-Horng [1 ]
Lee, Chin-Hui [3 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu, Taiwan
[2] Kore Univ Enna, Dept Telemat, Sicily, Italy
[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA USA
[4] Natl Taipei Univ, Dept Commun Engn, New Taipei, Taiwan
关键词
LVCSR; prosody modeling; attribute detector; knowledge integration; RECOGNITION; PROSODY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a cross-language knowledge integration framework to improve the performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, cross-lingual attribute detectors trained with an American English corpus (WSJ0) are utilized to verify and rescore hypothesized Mandarin syllables in word lattices obtained with state-of-the-art systems. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling technique using a Mandarin corpus (TCC300) are used in lattice rescoring. Experimental results on Mandarin syllable, character and word recognition with the TCC300 corpus show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. It also demonstrates a potential of utilizing results from cross-lingual attribute detectors as a language-universal frontend for automatic speech recognition.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [1] A Cross-Language Study of Stop Aspiration: English and Mandarin Chinese
    Chen, Li-mei
    Chao, Kuan-Yi
    Peng, Jui-Feng
    Yang, Jing-Chen
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 556 - 561
  • [2] A cross-language fMRI study of sentence-level prosody in Mandarin
    Gandour, J
    Tong, YX
    Talavage, T
    Wong, D
    Dzemidzic, M
    Xu, YS
    Lowe, M
    BRAIN AND LANGUAGE, 2005, 95 (01) : 54 - 55
  • [3] Cross-Language Perception of Lexical Tones by Nordic Learners of Mandarin Chinese
    Gao, Man
    LANGUAGES, 2024, 9 (02)
  • [4] A systematic study of knowledge graph analysis for cross-language plagiarism detection
    Franco-Salvador, Marc
    Rosso, Paolo
    Montes-y-Gomez, Manuel
    INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (04) : 550 - 570
  • [5] SEMCL: A Cross-Language Semantic Model for Knowledge Sharing
    Guo, Weisen
    Kraines, Steven B.
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2010, 1 (03) : 1 - 19
  • [6] CROSS-LANGUAGE STUDY OF PERCEPTUAL ASYMMETRY
    MYERS, TF
    WOLF, JJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 112 - &
  • [7] Study on cross-language information retrieval
    Si, Shen
    PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 6 - 10
  • [8] A cross-language state mapping approach to bilingual (Mandarin-English) TTS
    Liang, Hui
    Qian, Yao
    Soong, Frank K.
    Liu, Gongshen
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4641 - +
  • [9] Dynamic Range for Speech Materials in Korean, English, and Mandarin: A Cross-Language Comparison
    Jin, In-Ki
    Kates, James M.
    Arehart, Kathryn H.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2014, 57 (05): : 2024 - 2030
  • [10] Knowledge Integration for Improving Performance in LVCSR
    Chiang, Chen-Yu
    Siniscalchi, Sabato Marco
    Chen, Sin-Horng
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1785 - 1789