A Bayesian framework for fusing multiple word knowledge models in videotext recognition

被引:0
|
作者
Zhang, DQ [1 ]
Chang, SF [1 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
关键词
videotext recognition; video OCR; video indexing; information fusing; multimodal recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multiple knowledge using mixture models, and describe a learning approach based on Expectation-Maximization (EM). In order to handle unseen words, a back-off smoothing approach derived from the Bayesian model is also presented. We exploited a prototype that fuses the model from closed caption and that from the British National Corpus. The model from closed caption is based on a unique time distance distribution model of videotext words and closed caption words. Our method achieves a significant performance gain, with word recognition rate of 76.8% and character recognition rate of 86.7%. A proposed post processing method also improves videotext detection significantly, with precision at 91.8% and recall at 95.6%.
引用
收藏
页码:528 / 533
页数:6
相关论文
共 50 条
  • [41] A Bayesian knowledge engineering framework for service management
    Wang, Wei
    Wang, Hao
    Yang, Bo
    Lu, Liang
    Liu, Peini
    Zeng, Guosun
    2008 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, VOLS 1 AND 2, 2008, : 771 - +
  • [42] Knowledge-Aided GMTI in a Bayesian Framework
    Riedl, Michael
    Potter, Lee C.
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXII, 2015, 9475
  • [43] Knowledge-Aided GMTI in a Bayesian Framework
    Riedl, Michael
    Potter, Lee C.
    2015 IEEE INTERNATIONAL RADAR CONFERENCE (RADARCON), 2015, : 1240 - 1243
  • [44] Knowledge-aided GMTI in a Bayesian framework
    Department of Electrical and Computer Engineering, Ohio State University, Columbus
    OH, United States
    Proc SPIE Int Soc Opt Eng,
  • [45] Arabic Handwritten Word Recognition based on Dynamic Bayesian Network
    Jayech, Khaoula
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (6B) : 1024 - 1031
  • [46] Dynamic Hierarchical Bayesian Network for Arabic Handwritten Word Recognition
    Jayech, Khaoula
    Trimech, Nesrine
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua Essoukri
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
  • [47] BAYESIAN FRAME INTERPOLATION BY FUSING MULTIPLE MOTION-COMPENSATED PREDICTION FRAMES
    Liu, Hongbin
    Xiong, Ruiqin
    Ma, Siwei
    Zhao, Debin
    Gao, Wen
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1173 - 1176
  • [48] Learning Sentiment-Enhanced Word Representations by Fusing External Hybrid Sentiment Knowledge
    Li, You
    Lin, Zhizhou
    Lin, Yuming
    Yin, Jinhui
    Chang, Liang
    COGNITIVE COMPUTATION, 2023, 15 (06) : 1973 - 1987
  • [49] SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING MULTIPLE HIDDEN MARKOV-MODELS
    ZHANG, Y
    DESILVA, CJS
    TOGNERI, R
    ALDER, M
    ATTIKIOUZEL, Y
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (03): : 197 - 202
  • [50] Learning Sentiment-Enhanced Word Representations by Fusing External Hybrid Sentiment Knowledge
    You Li
    Zhizhou Lin
    Yuming Lin
    Jinhui Yin
    Liang Chang
    Cognitive Computation, 2023, 15 : 1973 - 1987