Recognition and Connection of Moving Captions in Arabic TV News

被引:0
|
作者
Iwata, Seiya [1 ]
Ohyama, Wataru [1 ]
Wakabayashi, Tetsushi [1 ]
Kimura, Fumitaka [1 ]
机构
[1] Mie Univ, Sch Engn, 1577 Kurima Machiya, Tsu, Mie 514, Japan
关键词
OCR; news caption recognition; Arabic word recognition; edit distance; moving news caption connection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The authors have conducted studies on Arabic news caption recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper proposes a dedicated OCR for recognizing low resolution news caption in video images. The news caption recognition system consisting of text line extraction, word segmentation and recognition of words is developed and the performance is experimentally evaluated using Dataset of frame images extracted from AlJazeera broadcasting programs. This paper also proposes a technique to connect the recognized moving news captions into a sentence. The proposed method is necessary for automatic language translation and is also capable of reducing the OCR errors due to truncated characters at both ends of the running news captions. The proposed connection method is a technique based on insertion operation with minimum edit distance between successive two news captions to be connected. Character likelihood based substitution method is newly proposed and comparatively tested with majority based substitution method. For the Dataset, character recognition rate (F-measure) after moving news caption connection by proposed method using bi-gram sequence (Method-B) realized simple processing and was improved to 98.74% from the rate 96.12% before connection processing.
引用
收藏
页码:163 / 167
页数:5
相关论文
共 50 条
  • [21] Recognition of Visual Arabic Scripting News Ticker From Broadcast Stream
    Tayyab, Moeen
    Hussain, Ayyaz
    Alshara, Mohammed Ali
    Khan, Shakir
    Alotaibi, Reemiah Muneer
    Baig, Abdul Rauf
    IEEE ACCESS, 2022, 10 : 59189 - 59204
  • [22] Towards Leveraging Closed Captions for News Retrieval
    Blanco, Roi
    De Francisci Morales, Gianmarco
    Silvestri, Fabrizio
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 135 - 136
  • [23] Automatic classification of TV news articles based on telop character recognition
    Arika, Y
    Matsuura, K
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 148 - 152
  • [24] Development of System that Automatically Attaches Captions on TV Images by AI
    Fujii Y.
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2020, 74 (03): : 39 - 42
  • [25] Where do those TV closed captions come from?
    Carvell, T
    FORTUNE, 1999, 139 (08) : 57 - 57
  • [26] THE ELDERLY READ TV, CAPTIONS AS WELL AS YOUNG-ADULTS
    THORN, SJ
    THORN, F
    MALLOY, DZ
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1995, 36 (04) : S912 - S912
  • [27] Change of Style and Contents of TV News from Moving Image Analysis and an Audience Survey
    Fushimoto, Kaori
    Ueda, Shuichi
    LIBRARY AND INFORMATION SCIENCE, 2009, (62): : 167 - 192
  • [28] A Dataset for Arabic Text Detection, Tracking and Recognition in News Videos- AcTiV
    Zayene, Oussama
    Henneber, Jean
    Touj, Sameh Masmoudi
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 996 - 1000
  • [29] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    JOURNAL OF IMAGING, 2018, 4 (02)
  • [30] Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA)
    Javier Rodriguez-Fuentes, Luis
    Varona, Amparo
    Penagarikano, Mikel
    Diez, Mireia
    Bordel, German
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 349 - 350