Improvement of video text recognition by character selection

被引:10
|
作者
Mita, T [1 ]
Hori, O [1 ]
机构
[1] Toshiba Co Ltd, Corp R&D Ctr, Multimedia Lab, Saiwai Ku, Kawasaki, Kanagawa 2128582, Japan
关键词
D O I
10.1109/ICDAR.2001.953954
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new method for improving the recognition accuracy of video text by exploiting the temporal redundancy of video. The proposed method divides the video into short segments and obtains several recognition results from some video segments. The video segments have various backgrounds because background image changes temporally due to camera-work or object motion. These recognition results from diverse backgrounds are integrated into a single text string after selecting the best recognition results of individual characters. The proposed method Has tested on a large set of news video sequences. Experimental results show that the proposed method increased the number of correct characters by 3.1% and the number of strings which do not include any recognition errors by 8.1%.
引用
收藏
页码:1089 / 1093
页数:3
相关论文
共 50 条
  • [1] Video text detection and segmentation for optical character recognition
    Ngo, CW
    Chan, CK
    [J]. MULTIMEDIA SYSTEMS, 2005, 10 (03) : 261 - 272
  • [2] Video text detection and segmentation for optical character recognition
    Chong-Wah Ngo
    Chi-Kwong Chan
    [J]. Multimedia Systems, 2005, 10 : 261 - 272
  • [3] Video text extraction from images for character recognition
    Amarapur, Basavaraj
    Patil, Nagaraj
    [J]. 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
  • [4] A New Gradient based Character Segmentation Method for Video Text Recognition
    Shivakumara, Palaiahnakote
    Bhowmick, Souvik
    Su, Bolan
    Tan, Chew Lim
    Pal, Umapada
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 126 - 130
  • [5] Character recognition in a Japanese text recognition system
    Hong, T
    Srikantan, G
    Zandy, VC
    Fang, C
    Srihari, SN
    [J]. DOCUMENT RECOGNITION III, 1996, 2660 : 51 - 62
  • [6] Text recognition in scene image and video frame using Color Channel selection
    Ayan Kumar Bhunia
    Gautam Kumar
    Partha Pratim Roy
    R. Balasubramanian
    Umapada Pal
    [J]. Multimedia Tools and Applications, 2018, 77 : 8551 - 8578
  • [7] Text recognition in scene image and video frame using Color Channel selection
    Bhunia, Ayan Kumar
    Kumar, Gautam
    Roy, Partha Pratim
    Balasubramanian, R.
    Pal, Umapada
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8551 - 8578
  • [8] A Modification of a Stopping Method for Text Recognition in a Video Stream with Best Frame Selection
    Tolstov, Ilya
    Martynov, Stanislav
    Farsobina, Vera
    Bulatov, Konstantin
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [9] Text Vectorization Based on Character Recognition and Character Stroke Modeling
    Fan, Zhigang
    Zhou, Bingfeng
    Tse, Francis
    Mu, Yadong
    He, Tao
    [J]. IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
  • [10] Automatic text segmentation and text recognition for video indexing
    Lienhart, R
    Effelsberg, W
    [J]. MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81