Feature learning and encoding for multi-script writer identification

被引:6
|
作者
Semma, Abdelillah [1 ]
Hannad, Yaacoub [2 ]
Siddiqi, Imran [3 ]
Lazrak, Said [1 ]
El Kettani, Mohamed El Youssfi [1 ]
机构
[1] Ibn Tofail Univ, Kenitra, Morocco
[2] Mohammed V Univ, Fac Educ Sci, Rabat, Morocco
[3] Bahria Univ, Islamabad, Pakistan
关键词
Multi-script writer Identification; Handwriting keypoints; Feature learning; Feature encoding; DESCRIPTORS; VLAD; INDIVIDUALITY; COMPETITION; DOCUMENTS;
D O I
10.1007/s10032-022-00394-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Writer identification from handwriting samples has been an interesting research problem for the pattern recognition community in general and handwriting recognition community in particular. In most cases, however, it is assumed that writers produce writing samples in a single script only. A more challenging scenario is the multi-script writer identification where the training and test samples of writers belong to different scripts. This paper presents a deep learning-based solution for writer identification in a multi-script scenario. The technique relies on identifying keypoints in handwriting and extracting small patches around these keypoints. These patches are aimed to capture the writing gestures of individuals which are likely to be common across multiple scripts. Robust feature representations are learned from these patches using a deep convolutional neural network and the features are encoded using a newly proposed variant of the Vector of Locally Aggregated Descriptors (VLAD). Experiments on three bilingual handwriting datasets including writing samples in Arabic, English, French, Chinese and Farsi report promising identification rates and significantly outperform the current state-of-the-art on this problem.
引用
收藏
页码:79 / 93
页数:15
相关论文
共 50 条
  • [1] Feature learning and encoding for multi-script writer identification
    Abdelillah Semma
    Yaâcoub Hannad
    Imran Siddiqi
    Said Lazrak
    Mohamed El Youssfi El Kettani
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 79 - 93
  • [2] Multi-script Writer Identification using Dissimilarity
    Bertolini, Diego
    Oliveira, Luiz S.
    Sabourin, Robert
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3025 - 3030
  • [3] Texture feature column scheme for single- and multi-script writer identification
    Abbas, Faycel
    Gattal, Abdeljalil
    Djeddi, Chawki
    Siddiqi, Imran
    Bensefia, Ameur
    Saoudi, Kamel
    [J]. IET BIOMETRICS, 2021, 10 (02) : 179 - 193
  • [4] Multi-script Writer Identification Optimized With Retrieval Mechanism
    Djeddi, Chawki
    Siddiqi, Imran
    Souici-Meslati, Labiba
    Ennaji, Abdellatif
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 509 - 514
  • [5] ICFHR 2018 Competition on Multi-Script Writer Identification
    Djeddi, Chawki
    Al-Maadeed, Somaya
    Siddiqi, Imran
    Gattal, Abdeljalil
    He, Sheng
    Akbari, Younes
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 506 - 510
  • [6] Script Identification of Multi-Script Documents: A Survey
    Ubul, Kurban
    Tursun, Gulzira
    Aysa, Alimjan
    Impedovo, Donato
    Pirlo, Giuseppe
    Yibulayin, Tuergen
    [J]. IEEE ACCESS, 2017, 5 : 6546 - 6559
  • [7] Word level multi-script identification
    Pati, Peeta Basa
    Ramakrishnan, A. G.
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1218 - 1229
  • [8] Multi-script Identification from Printed Words
    Jetley, Saumya
    Mehrotra, Kapil
    Vaze, Atish
    Belhe, Swapnil
    [J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 359 - 368
  • [9] Artistic multi-script identification at character level with extreme learning machine
    Ghosh, Mridul
    Mukherjee, Himadri
    Obaidullah, Sk Md
    Santosh, K. C.
    Das, Nibaran
    Roy, Kaushik
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 496 - 505
  • [10] Identification of different script lines from multi-script documents
    Pal, U
    Chaudhuri, BB
    [J]. IMAGE AND VISION COMPUTING, 2002, 20 (13-14) : 945 - 954