ICDAR2017 Competition on Arabic Text Detection and Recognition in Multi-resolution Video Frames

被引:2
|
作者
Zayene, Oussama [1 ,2 ]
Hennebert, Jean [3 ]
Ingold, Rolf [2 ]
BenAmara, Najoua Essoukri [1 ]
机构
[1] Univ Sousse, Natl Engn Sch Sousse Eniso, LATIS Lab, Sousse, Tunisia
[2] Univ Fribourg Unifr, Dept Informat, DIVA Grp, Fribourg, Switzerland
[3] Univ Appl Sci Western Switzerland, HES SO, Inst Complex Syst, Delemont, Switzerland
关键词
Arabic Text Detection; Arabic Text Recognition; Video-OCR; AcTiV dataset; ICDAR competition; HANDWRITING RECOGNITION; KHATT;
D O I
10.1109/ICDAR.2017.238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the multi-resolution Arabic Text detection and recognition in Video Competition-AcTiVComp held in the context of the 14th International Conference on Document Analysis and Recognition (ICDAR'2017), during November 9-15, 2017, in Kyoto, Japan. The main objective of this competition is to evaluate the performance of participants' algorithms for automatically detecting and recognizing Arabic texts in video frames using the freely available Arabic-Text-in-Video (AcTiV) dataset. A first edition was held in the framework of the 23rd International Conference on Pattern Recognition (ICPR'2016). Three groups with five systems are participating to the second edition of AcTiVComp. These systems are tested in a blind manner on a closed-subset of the AcTiV database, which is unknown to all participants. In addition to the experimental setup and observed results, we also provide a short description of the participating groups and their systems.
引用
收藏
页码:1460 / 1465
页数:6
相关论文
共 35 条
  • [31] Multi-dimensional long short-term memory networks for artificial Arabic text recognition in news video
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    [J]. IET COMPUTER VISION, 2018, 12 (05) : 710 - 719
  • [32] Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
    Raghunandan, K. S.
    Shivakumara, Palaiahnakote
    Roy, Sangheeta
    Kumar, G. Hemantha
    Pal, Umapada
    Lu, Tong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1145 - 1162
  • [33] Fractals based multi-oriented text detection system for recognition in mobile video images
    Shivakumara, Palaiahnakote
    Wu, Liang
    Lu, Tong
    Tan, Chew Lim
    Blumenstein, Michael
    Anami, Basavaraj S.
    [J]. PATTERN RECOGNITION, 2017, 68 : 158 - 174
  • [34] Multi-resolution approach to human activity recognition in video sequence based on combination of complex wavelet transform, Local Binary Pattern and Zernike moment
    Khare, Manish
    Jeon, Moongu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34863 - 34892
  • [35] Multi-resolution approach to human activity recognition in video sequence based on combination of complex wavelet transform, Local Binary Pattern and Zernike moment
    Manish Khare
    Moongu Jeon
    [J]. Multimedia Tools and Applications, 2022, 81 : 34863 - 34892