Recognition and Connection of Moving Captions in Arabic TV News

被引:0
|
作者
Iwata, Seiya [1 ]
Ohyama, Wataru [1 ]
Wakabayashi, Tetsushi [1 ]
Kimura, Fumitaka [1 ]
机构
[1] Mie Univ, Sch Engn, 1577 Kurima Machiya, Tsu, Mie 514, Japan
关键词
OCR; news caption recognition; Arabic word recognition; edit distance; moving news caption connection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The authors have conducted studies on Arabic news caption recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper proposes a dedicated OCR for recognizing low resolution news caption in video images. The news caption recognition system consisting of text line extraction, word segmentation and recognition of words is developed and the performance is experimentally evaluated using Dataset of frame images extracted from AlJazeera broadcasting programs. This paper also proposes a technique to connect the recognized moving news captions into a sentence. The proposed method is necessary for automatic language translation and is also capable of reducing the OCR errors due to truncated characters at both ends of the running news captions. The proposed connection method is a technique based on insertion operation with minimum edit distance between successive two news captions to be connected. Character likelihood based substitution method is newly proposed and comparatively tested with majority based substitution method. For the Dataset, character recognition rate (F-measure) after moving news caption connection by proposed method using bi-gram sequence (Method-B) realized simple processing and was improved to 98.74% from the rate 96.12% before connection processing.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [1] Recognition and Transition Frame Detection of Arabic News Captions for Video Retrieval
    Iwata, Seiya
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4005 - 4010
  • [2] A construction system for CALL materials from TV news with captions
    Kobayashi, Satoshi
    Tanaka, Takashi
    Mori, Kazumasa
    Nakagawa, Seiichi
    Transactions of the Japanese Society for Artificial Intelligence, 2002, 17 (04) : 500 - 509
  • [3] Recognition of telops in Arabic news broadcasting
    Iwata, Seiya
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    IEEJ Transactions on Electronics, Information and Systems, 2016, 136 (12): : 1668 - 1676
  • [4] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Sato, T
    Kanade, T
    Hughes, EK
    Smith, MA
    Satoh, S
    MULTIMEDIA SYSTEMS, 1999, 7 (05) : 385 - 395
  • [5] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Toshio Sato
    Takeo Kanade
    Ellen K. Hughes
    Michael A. Smith
    Shin'ichi Satoh
    Multimedia Systems, 1999, 7 : 385 - 395
  • [6] Evaluation of closed captions for TV commercials
    Fukushima, Takahiro
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2015, 69 (08): : 248 - 252
  • [7] ALIF: A Dataset for Arabic Embedded Text Recognition in TV Broadcast
    Yousfi, Sonia
    Berrani, Sid-Ahmed
    Garcia, Christophe
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1221 - 1225
  • [8] Development of a TV Broadcasts Speech Recognition System for Qatari Arabic
    Elmahdy, Mohamed
    Hasegawa-Johnson, Mark
    Mustafawi, Eiman
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3057 - 3061
  • [9] Indexing and retrieval methods of moving image database for TV news
    Shimo, Y
    LIBRARY AND INFORMATION SCIENCE, 1995, (34): : 17 - 28
  • [10] CHIP SET CAPTIONS TV PROGRAMS FOR DEAF
    IVERSEN, WR
    ELECTRONICS-US, 1980, 53 (02): : 41 - +