ARASTI: A Database for Arabic Scene Text Recognition

被引:0
|
作者
Tounsi, Maroua [1 ]
Moalla, Ikram [1 ,2 ]
Alimi, Adel M. [1 ]
机构
[1] ENIS Sfax, Res Grp Intelligent Machines REGIM Lab, Sfax, Tunisia
[2] Al Baha Univ, Al Bahah, Saudi Arabia
关键词
Arabic scene text; ARASTI Database; Character recognition;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text in natural scenes provides many information for peoples and presents an essential tool to interact with their environment. Therefore, recognizing text existing in camera-captured images has become an important issue for many researches in the last decades. Currently, there isn't any available dataset of Arabic script text images in the wild. Since our aim is to help the research community in standardizing the evaluation of scene Arabic text recognition, we present in this paper a database of images of Arabic Scene Text, segmented scene Arabic words and segmented scene Arabic characters. We call this dataset ARASTI (ARAbic Scene Text Image). This database contains diverse natural scenes images captured at varying weather, lighting and perspective conditions. Moreover, characters and words are also segmented from the original images and stored individually. We obtain 1687 images, 1280 segmented scene Arabic words and 2093 scene Arabic character images. Compared to public datasets of scene text images in other languages like ICDAR03, Chars74K, etc., ARASTI contains a competitive number of images to these databases already published which proves that it can be used as a benchmark.
引用
收藏
页码:140 / 144
页数:5
相关论文
共 50 条
  • [1] Unconstrained Scene Text and Video Text Recognition for Arabic Script
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    [J]. 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30
  • [2] A Database for Offline Arabic Handwritten Text Recognition
    Mahmoud, Sabri A.
    Ahmad, Irfan
    Alshayeb, Mohammed
    Al-Khatib, Wasfi G.
    [J]. IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 397 - 406
  • [3] Database for Arabic Printed Text Recognition Research
    Jaiem, Faten Kallel
    Kanoun, Slim
    Khemakhem, Maher
    El Abed, Haikal
    Kardoun, Jihain
    [J]. IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT 1, 2013, 8156 : 251 - 259
  • [4] Printed Arabic Text Database for Automatic Recognition Systems
    Bouressace, Hassina
    Csirik, Janos
    [J]. PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA 2019), 2019, : 107 - 111
  • [5] Arabic Cursive Text Recognition from Natural Scene Images
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (02):
  • [6] A Database for Arabic Handwritten Text Image Recognition and Writer Identification
    Mezghani, Anis
    Kanoun, Slim
    Khemakhem, Maher
    El Abed, Haikal
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 399 - 402
  • [7] ALTID : Arabic/Latin Text Images Database for recognition research
    Chtourou, Imen
    Rouhou, Ahmed Cheikh
    Jaiem, Faten Kallel
    Kanoun, Slim
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 836 - 840
  • [8] A Database for Urdu Text Detection and Recognition in Natural Scene Images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Memon, Mukhtiar Ahmed
    Leghari, Mehjabeen
    Jalbani, Akhtar Hussain
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 47 - 54
  • [9] Arabic Scene Text Recognition in the Deep Learning Era: Analysis on a Novel Dataset
    Hassan, Heba
    El-Mahdy, Ahmed
    Hussein, Mohamed E.
    [J]. IEEE ACCESS, 2021, 9 : 107046 - 107058
  • [10] Benchmark database and GUI environment for printed arabic text recognition research
    Al-Hashim, Amin G.
    Mahmoud, Sabri A.
    [J]. WSEAS Transactions on Information Science and Applications, 2010, 7 (04): : 587 - 597