Fully automated approach to broadcast news transcription in Czech language

被引:0
|
作者
Nouza, J [1 ]
Zdánsky, J [1 ]
David, P [1 ]
机构
[1] Tech Univ Liberec, SpeechLab, Liberec 46117 1, Czech Republic
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paper(1) we propose a complete scheme for automatic transcription of Czech TV news. The scheme first removes the music and noisy parts, then makes segmentation of the speech signal into speaker turns and consequently tries to decode and transcribe single utterances. We employ our own recognizer recently operating with a 200K-word lexicon and with a bigram language model. The overall recognition rate achieved on all the test data was 71.53%, that obtained on the read parts was 82.72%. The most serious recognition errors occur mainly in the segments that contain background music or extremely loud noise.
引用
收藏
页码:401 / 408
页数:8
相关论文
共 50 条
  • [1] Czech-to-Slovak Adapted Broadcast News Transcription System
    Nouza, Jan
    Silovsky, Jan
    Zdansky, Jindrich
    Cerva, Petr
    Kroul, Martin
    Chaloupka, Josef
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2683 - 2686
  • [2] Incremental language modeling for automatic transcription of broadcast news
    Ohtsuki, Katsutoshi
    Nguyen, Long
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (02): : 526 - 532
  • [3] First Broadcast News Transcription System for Khmer Language
    Seng, Sopheap
    Sam, Sethserey
    Besacier, Laurent
    Bigi, Brigitte
    Castelli, Eric
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2658 - 2661
  • [4] Language Modeling for Automatic Turkish Broadcast News Transcription
    Arisoy, Ebru
    Sak, Hasim
    Saraclar, Murat
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2748 - 2751
  • [5] Broadcast news transcription
    Kubala, F
    Jin, H
    Matsoukas, S
    Nguyen, L
    Schwartz, R
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 203 - 206
  • [6] Statistical language model adaptation for Mandarin broadcast news transcription
    Chen, B
    Tsai, WH
    Kuo, JW
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 313 - 316
  • [7] Japanese broadcast news transcription
    BBN Technologies, 10 Moulton St., Cambridge
    MA
    02138, United States
    [J]. Int. Conf. Spok. Lang. Process., ICSLP, (1749-1752):
  • [8] Automatic transcription of Broadcast News
    Chen, SS
    Eide, E
    Gales, MJF
    Gopinath, RA
    Kanvesky, D
    Olsen, P
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 69 - 87
  • [9] Experiments in broadcast news transcription
    Woodland, PC
    Hain, T
    Johnson, SE
    Niesler, TR
    Tuerk, A
    Young, SJ
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 909 - 912
  • [10] Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
    Saykham, Kwanchiva
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1690 - 1694