A Bayesian framework for fusing multiple word knowledge models in videotext recognition

被引：0

作者：

Zhang, DQ ^{[1
]}

Chang, SF ^{[1
]}

机构：

[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA

来源：

2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL II, PROCEEDINGS | 2003年

关键词：

videotext recognition; video OCR; video indexing; information fusing; multimodal recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Videotext recognition is challenging due to low resolution, diverse fonts/styles, and cluttered background. Past methods enhanced recognition by using multiple frame averaging, image interpolation and lexicon correction, but recognition using multi-modality language models has not been explored. In this paper, we present a formal Bayesian framework for videotext recognition by combining multiple knowledge using mixture models, and describe a learning approach based on Expectation-Maximization (EM). In order to handle unseen words, a back-off smoothing approach derived from the Bayesian model is also presented. We exploited a prototype that fuses the model from closed caption and that from the British National Corpus. The model from closed caption is based on a unique time distance distribution model of videotext words and closed caption words. Our method achieves a significant performance gain, with word recognition rate of 76.8% and character recognition rate of 86.7%. A proposed post processing method also improves videotext detection significantly, with precision at 91.8% and recall at 95.6%.

引用

页码：528 / 533

页数：6

共 50 条

[41] A Bayesian knowledge engineering framework for service management
Wang, Wei
Wang, Hao
Yang, Bo
Lu, Liang
Liu, Peini
Zeng, Guosun
2008 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, VOLS 1 AND 2, 2008, : 771 - +
[42] Knowledge-Aided GMTI in a Bayesian Framework
Riedl, Michael
Potter, Lee C.
ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXII, 2015, 9475
[43] Knowledge-Aided GMTI in a Bayesian Framework
Riedl, Michael
Potter, Lee C.
2015 IEEE INTERNATIONAL RADAR CONFERENCE (RADARCON), 2015, : 1240 - 1243
[44] Knowledge-aided GMTI in a Bayesian framework
Department of Electrical and Computer Engineering, Ohio State University, Columbus
OH, United States
Proc SPIE Int Soc Opt Eng,
[45] Arabic Handwritten Word Recognition based on Dynamic Bayesian Network
Jayech, Khaoula
Mahjoub, Mohamed Ali
Ben Amara, Najoua
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (6B) : 1024 - 1031
[46] Dynamic Hierarchical Bayesian Network for Arabic Handwritten Word Recognition
Jayech, Khaoula
Trimech, Nesrine
Mahjoub, Mohamed Ali
Ben Amara, Najoua Essoukri
2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
[47] BAYESIAN FRAME INTERPOLATION BY FUSING MULTIPLE MOTION-COMPENSATED PREDICTION FRAMES
Liu, Hongbin
Xiong, Ruiqin
Ma, Siwei
Zhao, Debin
Gao, Wen
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1173 - 1176
[48] Learning Sentiment-Enhanced Word Representations by Fusing External Hybrid Sentiment Knowledge
Li, You
Lin, Zhizhou
Lin, Yuming
Yin, Jinhui
Chang, Liang
COGNITIVE COMPUTATION, 2023, 15 (06) : 1973 - 1987
[49] SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING MULTIPLE HIDDEN MARKOV-MODELS
ZHANG, Y
DESILVA, CJS
TOGNERI, R
ALDER, M
ATTIKIOUZEL, Y
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (03): : 197 - 202
[50] Learning Sentiment-Enhanced Word Representations by Fusing External Hybrid Sentiment Knowledge
You Li
Zhizhou Lin
Yuming Lin
Jinhui Yin
Liang Chang
Cognitive Computation, 2023, 15 : 1973 - 1987

← 1 2 3 4 5 →