Recognition and Connection of Moving Captions in Arabic TV News

被引：0

作者：

Iwata, Seiya ^{[1
]}

Ohyama, Wataru ^{[1
]}

Wakabayashi, Tetsushi ^{[1
]}

Kimura, Fumitaka ^{[1
]}

机构：

[1] Mie Univ, Sch Engn, 1577 Kurima Machiya, Tsu, Mie 514, Japan

来源：

2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR) | 2017年

关键词：

OCR; news caption recognition; Arabic word recognition; edit distance; moving news caption connection;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The authors have conducted studies on Arabic news caption recognition to develop a system for video retrieval by keyword to index and edit Arabic broadcast programs received daily and stored in a big database. This paper proposes a dedicated OCR for recognizing low resolution news caption in video images. The news caption recognition system consisting of text line extraction, word segmentation and recognition of words is developed and the performance is experimentally evaluated using Dataset of frame images extracted from AlJazeera broadcasting programs. This paper also proposes a technique to connect the recognized moving news captions into a sentence. The proposed method is necessary for automatic language translation and is also capable of reducing the OCR errors due to truncated characters at both ends of the running news captions. The proposed connection method is a technique based on insertion operation with minimum edit distance between successive two news captions to be connected. Character likelihood based substitution method is newly proposed and comparatively tested with majority based substitution method. For the Dataset, character recognition rate (F-measure) after moving news caption connection by proposed method using bi-gram sequence (Method-B) realized simple processing and was improved to 98.74% from the rate 96.12% before connection processing.

引用

页码：163 / 167

页数：5

共 50 条

[21] Recognition of Visual Arabic Scripting News Ticker From Broadcast Stream
Tayyab, Moeen
Hussain, Ayyaz
Alshara, Mohammed Ali
Khan, Shakir
Alotaibi, Reemiah Muneer
Baig, Abdul Rauf
IEEE ACCESS, 2022, 10 : 59189 - 59204
[22] Towards Leveraging Closed Captions for News Retrieval
Blanco, Roi
De Francisci Morales, Gianmarco
Silvestri, Fabrizio
PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 135 - 136
[23] Automatic classification of TV news articles based on telop character recognition
Arika, Y
Matsuura, K
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 148 - 152
[24] Development of System that Automatically Attaches Captions on TV Images by AI
Fujii Y.
Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2020, 74 (03): : 39 - 42
[25] Where do those TV closed captions come from?
Carvell, T
FORTUNE, 1999, 139 (08) : 57 - 57
[26] THE ELDERLY READ TV, CAPTIONS AS WELL AS YOUNG-ADULTS
THORN, SJ
THORN, F
MALLOY, DZ
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1995, 36 (04) : S912 - S912
[27] Change of Style and Contents of TV News from Moving Image Analysis and an Audience Survey
Fushimoto, Kaori
Ueda, Shuichi
LIBRARY AND INFORMATION SCIENCE, 2009, (62): : 167 - 192
[28] A Dataset for Arabic Text Detection, Tracking and Recognition in News Videos- AcTiV
Zayene, Oussama
Henneber, Jean
Touj, Sameh Masmoudi
Ingold, Rolf
Ben Amara, Najoua Essoukri
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 996 - 1000
[29] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
Zayene, Oussama
Touj, Sameh Masmoudi
Hennebert, Jean
Ingold, Rolf
Ben Amara, Najoua Essoukri
JOURNAL OF IMAGING, 2018, 4 (02)
[30] Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA)
Javier Rodriguez-Fuentes, Luis
Varona, Amparo
Penagarikano, Mikel
Diez, Mireia
Bordel, German
PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 349 - 350

← 1 2 3 4 5 →