Investigation on Mandarin Broadcast News Speech Recognition

被引:0
|
作者
Hwang, Mei-Yuh [1 ]
Lei, Xin [1 ]
Wang, Wen [2 ]
Shinozaki, Takahiro [1 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] SRI Int, Menlo Pk, CA 94025 USA
关键词
Mandarin speech recognition; character error rate; pitch smoothing; word segmentation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our efforts in building a competitive Mandarin broadcast news speech recognizer. We successfully incorporated the most popular speech technologies into our system. More importantly, we present two novel algorithms in smoothing pitch features and segmenting Chinese characters into word units. Additionally, we propose to borrow the principle of pointwise mutual information for creating a Chinese word lexicon automatically. Our final system achieved 6.0% character error rate (CER) on dev04 and 16.0% on eval04, with simpler acoustic models, less training data, and simpler decoding architecture compared with other state-of-the-art systems, yet was equally competitive.
引用
收藏
页码:1233 / +
页数:2
相关论文
共 50 条
  • [1] A study on Mandarin broadcast news speech recognition
    Chen, CL
    Wang, YR
    Chen, SH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 257 - 260
  • [2] Multifactor Adaptation for Mandarin Broadcast News and Conversation Speech Recognition
    Wang, Wen
    Mandal, Arindam
    Lei, Xin
    Stolcke, Andreas
    Zheng, Jing
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2099 - 2102
  • [3] Improved Tone Modeling for Mandarin Broadcast News Speech Recognition
    Lei, Xin
    Siu, Manhung
    Hwang, Mei-Yuh
    Ostendorf, Mari
    Lee, Tan
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1237 - +
  • [4] Advances in Mandarin Broadcast Speech Recognition
    Hwang, Mei-Yuh
    Wang, Wen
    Lei, Xin
    Zheng, Jing
    Cetin, Ozgur
    Peng, Gang
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2876 - +
  • [5] Voice retrieval of Mandarin broadcast news speech
    Chen, B
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2006, 20 (01) : 91 - 109
  • [6] DATA-DRIVEN LEXICON EXPANSION FOR MANDARIN BROADCAST NEWS AND CONVERSATION SPEECH RECOGNITION
    Lei, Xin
    Wang, Wen
    Stolcke, Andreas
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4329 - 4332
  • [7] Connectionist speech recognition of Broadcast News
    Robinson, AJ
    Cook, GD
    Ellis, DPW
    Fosler-Lussier, E
    Renals, SJ
    Williams, DAG
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 27 - 45
  • [8] Speech recognition for Turkish broadcast news
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1054 - 1057
  • [9] F0 declination in English and Mandarin Broadcast News Speech
    Yuan, Jiahong
    Liberman, Mark
    [J]. SPEECH COMMUNICATION, 2014, 65 : 67 - 74
  • [10] F0 Declination in English and Mandarin Broadcast News Speech
    Yuan, Jiahong
    Liberman, Mark
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 134 - 137