HMM-based Sliding Video Text Recognition for Turkish Broadcast News

被引:0
|
作者
Som, Temucin [1 ]
Can, Dogan [1 ]
Saraclar, Murat [1 ]
机构
[1] Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we develop an HMM-based sliding video text recognizer and present our results on Turkish broadcast news for the hearing impaired. We use well known speech recognition techniques to model and recognize sliding video text characters using a minimal amount of labeled data. Baseline system without any language modeling gives a word error rate of 2.2% on 138 minutes of test data. We then provide an analysis of character errors and employ a character-based language model to correct most of them. Finally we decrease the amount of training data to a quarter, split the test data into halves and investigate semi-supervised training. Word error rates after semi-supervised training are significantly lower than to those after baseline training. We see 40% relative reduction in word error rate (1.5 -> 0.9) over the test set.
引用
收藏
页码:474 / 478
页数:5
相关论文
共 50 条
  • [1] Sliding Text Recognition in Broadcast News
    Dikici, Erinc
    Saraclar, Murat
    [J]. 2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 705 - 708
  • [2] An HMM-based text segmentation method using variational Bayes approach and its application to LVCSR for broadcast news
    Koshinaka, T
    Iso, K
    Okumura, A
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 485 - 488
  • [3] An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text
    Rashid, Sheikh Faisal
    Shafait, Faisal
    Breuel, Thomas M.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1260 - 1264
  • [4] HMM-based approach for text region detection in coded video bitstreams
    Nakano, Yutaka
    Kashio, Katsuaki
    Yoshida, Toshiyuki
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 3209 - +
  • [5] HMM-based ball hitting event exploration system for broadcast baseball video
    Chen, Hua-Tsung
    Chou, Chien-Li
    Tsai, Wei-Chin
    Lee, Suh-Yin
    Lin, Bao-Shuh P.
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (05) : 767 - 781
  • [6] HMM-based Multi Oriented Text Recognition in Natural Scene Image
    Roy, Sangheeta
    Roy, Partha Pratim
    Shivakumara, Palaiahnakote
    Louloudis, Georgios
    Tan, Chew Lim
    Pal, Umapada
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 288 - 292
  • [7] Speech recognition for Turkish broadcast news
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1054 - 1057
  • [8] HMM-based segmentation and recognition of human activities from video sequences
    Niu, F
    Abdel-Mottaleb, M
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 804 - 807
  • [9] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [10] An HMM-based framework for video semantic analysis
    Xu, G
    Ma, YF
    Zhang, HJ
    Yang, SQ
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (11) : 1422 - 1433