A Bayesian framework for fusing multiple word knowledge models in videotext recognition

被引：0

作者：

Zhang, DQ ^{[1
]}

Chang, SF ^{[1
]}

机构：

[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA

来源：

2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL II, PROCEEDINGS | 2003年

关键词：

videotext recognition; video OCR; video indexing; information fusing; multimodal recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multiple knowledge using mixture models, and describe a learning approach based on Expectation-Maximization (EM). In order to handle unseen words, a back-off smoothing approach derived from the Bayesian model is also presented. We exploited a prototype that fuses the model from closed caption and that from the British National Corpus. The model from closed caption is based on a unique time distance distribution model of videotext words and closed caption words. Our method achieves a significant performance gain, with word recognition rate of 76.8% and character recognition rate of 86.7%. A proposed post processing method also improves videotext detection significantly, with precision at 91.8% and recall at 95.6%.

引用

页码：528 / 533

页数：6

共 50 条

[31] Adaptive Bayesian Recognition with Multiple Evidences
Naguib, Ahmed M.
Lee, Sukhan
2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 337 - 344
[32] Probabilistic Object Recognition and Pose Estimation by Fusing Multiple Algorithms
Lutz, Matthias
Stampfer, Dennis
Schlegel, Christian
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4244 - 4249
[33] Fusing multiple methods for discovering implicit knowledge in biomedical literature
Chen, Ran
Lin, Hongfei
Yang, Zhihao
Journal of Information and Computational Science, 2009, 6 (03): : 1615 - 1625
[34] Fusing Multiple Features for Depth-Based Action Recognition
Zhu, Yu
Chen, Wenbin
Guo, Guodong
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
[35] A Comprehensive Bayesian Framework for Envelope Models
Chakraborty, Saptarshi
Su, Zhihua
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (547) : 2129 - 2139
[36] A variational Bayesian framework for graphical models
Attias, H
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 209 - 215
[37] Exploring Features in a Bayesian Framework for Material Recognition
Liu, Ce
Sharan, Lavanya
Adelson, Edward H.
Rosenholtz, Ruth
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 239 - 246
[38] Accurate Ego-Lane Recognition utilizing Multiple Road Characteristics in a Bayesian Network Framework
Lee, Soomok
Kim, Seong-Woo
Seo, Seung-Woo
2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 543 - 548
[39] Recognition of Parathyroid Nodule by Fusing Prior Knowledge Features in Ultrasound Image
Mao L.
Zhao L.-Q.
Yu M.-A.
Wei Y.
Wang Y.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (05): : 944 - 952
[40] A two-stage framework for pig disease knowledge graph fusing
Jiang, Tingting
Zhang, Zhiyi
Hu, Shunxin
Yang, Shuai
He, Jin
Wang, Chao
Gu, Lichuan
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 229

← 1 2 3 4 5 →