Improvement of video text recognition by character selection

被引：10

作者：

Mita, T ^{[1
]}

Hori, O ^{[1
]}

机构：

[1] Toshiba Co Ltd, Corp R&D Ctr, Multimedia Lab, Saiwai Ku, Kawasaki, Kanagawa 2128582, Japan

来源：

SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS | 2001年

关键词：

D O I：

10.1109/ICDAR.2001.953954

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new method for improving the recognition accuracy of video text by exploiting the temporal redundancy of video. The proposed method divides the video into short segments and obtains several recognition results from some video segments. The video segments have various backgrounds because background image changes temporally due to camera-work or object motion. These recognition results from diverse backgrounds are integrated into a single text string after selecting the best recognition results of individual characters. The proposed method Has tested on a large set of news video sequences. Experimental results show that the proposed method increased the number of correct characters by 3.1% and the number of strings which do not include any recognition errors by 8.1%.

引用

页码：1089 / 1093

页数：3

共 50 条

[1] Video text detection and segmentation for optical character recognition
Ngo, CW
Chan, CK
[J]. MULTIMEDIA SYSTEMS, 2005, 10 (03) : 261 - 272
[2] Video text detection and segmentation for optical character recognition
Chong-Wah Ngo
Chi-Kwong Chan
[J]. Multimedia Systems, 2005, 10 : 261 - 272
[3] Video text extraction from images for character recognition
Amarapur, Basavaraj
Patil, Nagaraj
[J]. 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
[4] A New Gradient based Character Segmentation Method for Video Text Recognition
Shivakumara, Palaiahnakote
Bhowmick, Souvik
Su, Bolan
Tan, Chew Lim
Pal, Umapada
[J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 126 - 130
[5] Character recognition in a Japanese text recognition system
Hong, T
Srikantan, G
Zandy, VC
Fang, C
Srihari, SN
[J]. DOCUMENT RECOGNITION III, 1996, 2660 : 51 - 62
[6] Text recognition in scene image and video frame using Color Channel selection
Ayan Kumar Bhunia
Gautam Kumar
Partha Pratim Roy
R. Balasubramanian
Umapada Pal
[J]. Multimedia Tools and Applications, 2018, 77 : 8551 - 8578
[7] Text recognition in scene image and video frame using Color Channel selection
Bhunia, Ayan Kumar
Kumar, Gautam
Roy, Partha Pratim
Balasubramanian, R.
Pal, Umapada
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8551 - 8578
[8] A Modification of a Stopping Method for Text Recognition in a Video Stream with Best Frame Selection
Tolstov, Ilya
Martynov, Stanislav
Farsobina, Vera
Bulatov, Konstantin
[J]. THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
[9] Text Vectorization Based on Character Recognition and Character Stroke Modeling
Fan, Zhigang
Zhou, Bingfeng
Tse, Francis
Mu, Yadong
He, Tao
[J]. IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
[10] Automatic text segmentation and text recognition for video indexing
Lienhart, R
Effelsberg, W
[J]. MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81

← 1 2 3 4 5 →