Language Modeling of Chinese Personal Names Based on Character Units for Continuous Chinese Speech Recognition

被引:0
|
作者
Hu, Xinhui [1 ]
Yamamoto, Hirofumi [1 ]
Kikui, Genichiro
Sagisaka, Yoshinori [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Hikaridai 2-2-2, Seika, Kyoto 6190228, Japan
关键词
Personal Name Identification; Hierarchical language model; Chinese Speech Recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we analyze Chinese personal names to model their statistical phonotactic characteristics for continuous Chinese speech recognition. The analysis showed language-specific characteristics of Chinese personal names and strongly suggested the advantage of character-unit oriented modeling. A hierarchical language model was composed by reflecting statistical phonotactic characteristics of Chinese personal names as a lower intra-word model, and ordinary inter-word neighboring characteristics as an upper multi-class composite N-gram model. These two layers of models were trained independently using different language corpora. For the modeling of given names, the syllable without tone information was selected as the unit for training the bi-gram. The properties of either one or two characters of a given name were introduced to simplify the length constraint of the modeling process. For Chinese family names, we simply added them directly in the recognition lexicon, since their numbers are very restricted. The results from Chinese speech recognition experiments revealed that the proposed hierarchical language model greatly improved the identification accuracy of the Chinese given names compared with the conventional wordclass N-gram model.
引用
收藏
页码:1874 / +
页数:2
相关论文
共 50 条
  • [1] A novel statistical language modeling method for continuous Chinese speech recognition
    Tian, B
    Tian, HX
    Fu, Q
    Yi, KC
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 734 - 737
  • [2] Recognition of Chinese names in continuous speech for directory assistance applications
    Liao, YF
    Rose, G
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 741 - 744
  • [3] Integration of speech and language processing in Chinese continuous speech recognition
    ZHAO Li ZOU Cairong WU Zhenyang(Department of Radio Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 343 - 351
  • [4] Joint-Character-POC N-Gram Language Modeling For Chinese Speech Recognition
    Wang, Bin
    Ou, Zhijian
    Li, Jian
    Kawamura, Akinori
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 24 - +
  • [5] Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese
    Wu, JJX
    Deng, L
    Chan, J
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2281 - 2284
  • [6] Improved large vocabulary continuous chinese speech recognition by character-based consensus networks
    Fu, Yi-Sheng
    Pan, Yi-Cheng
    Lee, Lin-shan
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 422 - +
  • [7] HMM BASED RECOGNITION OF CHINESE TONES IN CONTINUOUS SPEECH
    Zhao Li (Department of Radio Engineering
    [J]. Journal of Electronics(China), 2000, (01) : 9 - 14
  • [8] HMM based recognition of Chinese tones in continuous speech
    Cheng, ML
    Cheng, XM
    Zhao, L
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 916 - 919
  • [9] CHARACTER-BASED UNITS FOR UNLIMITED VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Smit, Peter
    Gangireddy, Siva Reddy
    Enarvi, Seppo
    Virpioja, Sami
    Kurimo, Mikko
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 149 - 156
  • [10] The nature of perceptual units in Chinese character recognition
    Issele, Joanna
    Chetail, Fabienne
    Content, Alain
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2022, 75 (08): : 1514 - 1527