ICDAR2017 Competition on Arabic Text Detection and Recognition in Multi-resolution Video Frames

被引：2

作者：

Zayene, Oussama ^{[1
,2
]}

Hennebert, Jean ^{[3
]}

Ingold, Rolf ^{[2
]}

BenAmara, Najoua Essoukri ^{[1
]}

机构：

[1] Univ Sousse, Natl Engn Sch Sousse Eniso, LATIS Lab, Sousse, Tunisia

[2] Univ Fribourg Unifr, Dept Informat, DIVA Grp, Fribourg, Switzerland

[3] Univ Appl Sci Western Switzerland, HES SO, Inst Complex Syst, Delemont, Switzerland

来源：

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年

关键词：

Arabic Text Detection; Arabic Text Recognition; Video-OCR; AcTiV dataset; ICDAR competition; HANDWRITING RECOGNITION; KHATT;

D O I：

10.1109/ICDAR.2017.238

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the multi-resolution Arabic Text detection and recognition in Video Competition-AcTiVComp held in the context of the 14th International Conference on Document Analysis and Recognition (ICDAR'2017), during November 9-15, 2017, in Kyoto, Japan. The main objective of this competition is to evaluate the performance of participants' algorithms for automatically detecting and recognizing Arabic texts in video frames using the freely available Arabic-Text-in-Video (AcTiV) dataset. A first edition was held in the framework of the 23rd International Conference on Pattern Recognition (ICPR'2016). Three groups with five systems are participating to the second edition of AcTiVComp. These systems are tested in a blind manner on a closed-subset of the AcTiV database, which is unknown to all participants. In addition to the experimental setup and observed results, we also provide a short description of the participating groups and their systems.

引用

页码：1460 / 1465

页数：6

共 35 条

[31] Multi-dimensional long short-term memory networks for artificial Arabic text recognition in news video
Zayene, Oussama
Touj, Sameh Masmoudi
Hennebert, Jean
Ingold, Rolf
Ben Amara, Najoua Essoukri
[J]. IET COMPUTER VISION, 2018, 12 (05) : 710 - 719
[32] Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
Raghunandan, K. S.
Shivakumara, Palaiahnakote
Roy, Sangheeta
Kumar, G. Hemantha
Pal, Umapada
Lu, Tong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1145 - 1162
[33] Fractals based multi-oriented text detection system for recognition in mobile video images
Shivakumara, Palaiahnakote
Wu, Liang
Lu, Tong
Tan, Chew Lim
Blumenstein, Michael
Anami, Basavaraj S.
[J]. PATTERN RECOGNITION, 2017, 68 : 158 - 174
[34] Multi-resolution approach to human activity recognition in video sequence based on combination of complex wavelet transform, Local Binary Pattern and Zernike moment
Khare, Manish
Jeon, Moongu
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34863 - 34892
[35] Multi-resolution approach to human activity recognition in video sequence based on combination of complex wavelet transform, Local Binary Pattern and Zernike moment
Manish Khare
Moongu Jeon
[J]. Multimedia Tools and Applications, 2022, 81 : 34863 - 34892

← 1 2 3 4 →