A Bayesian framework for fusing multiple word knowledge models in videotext recognition

被引:0
|
作者
Zhang, DQ [1 ]
Chang, SF [1 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
关键词
videotext recognition; video OCR; video indexing; information fusing; multimodal recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multiple knowledge using mixture models, and describe a learning approach based on Expectation-Maximization (EM). In order to handle unseen words, a back-off smoothing approach derived from the Bayesian model is also presented. We exploited a prototype that fuses the model from closed caption and that from the British National Corpus. The model from closed caption is based on a unique time distance distribution model of videotext words and closed caption words. Our method achieves a significant performance gain, with word recognition rate of 76.8% and character recognition rate of 86.7%. A proposed post processing method also improves videotext detection significantly, with precision at 91.8% and recall at 95.6%.
引用
收藏
页码:528 / 533
页数:6
相关论文
共 50 条
  • [31] Adaptive Bayesian Recognition with Multiple Evidences
    Naguib, Ahmed M.
    Lee, Sukhan
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 337 - 344
  • [32] Probabilistic Object Recognition and Pose Estimation by Fusing Multiple Algorithms
    Lutz, Matthias
    Stampfer, Dennis
    Schlegel, Christian
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4244 - 4249
  • [33] Fusing multiple methods for discovering implicit knowledge in biomedical literature
    Chen, Ran
    Lin, Hongfei
    Yang, Zhihao
    Journal of Information and Computational Science, 2009, 6 (03): : 1615 - 1625
  • [34] Fusing Multiple Features for Depth-Based Action Recognition
    Zhu, Yu
    Chen, Wenbin
    Guo, Guodong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
  • [35] A Comprehensive Bayesian Framework for Envelope Models
    Chakraborty, Saptarshi
    Su, Zhihua
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (547) : 2129 - 2139
  • [36] A variational Bayesian framework for graphical models
    Attias, H
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 209 - 215
  • [37] Exploring Features in a Bayesian Framework for Material Recognition
    Liu, Ce
    Sharan, Lavanya
    Adelson, Edward H.
    Rosenholtz, Ruth
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 239 - 246
  • [38] Accurate Ego-Lane Recognition utilizing Multiple Road Characteristics in a Bayesian Network Framework
    Lee, Soomok
    Kim, Seong-Woo
    Seo, Seung-Woo
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 543 - 548
  • [39] Recognition of Parathyroid Nodule by Fusing Prior Knowledge Features in Ultrasound Image
    Mao L.
    Zhao L.-Q.
    Yu M.-A.
    Wei Y.
    Wang Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (05): : 944 - 952
  • [40] A two-stage framework for pig disease knowledge graph fusing
    Jiang, Tingting
    Zhang, Zhiyi
    Hu, Shunxin
    Yang, Shuai
    He, Jin
    Wang, Chao
    Gu, Lichuan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 229