Digital Hebrew Paleography: Script Types and Modes

被引:4
|
作者
Droby, Ahmad [1 ]
Rabaev, Irina [2 ]
Shapira, Daria Vasyutinsky [1 ]
Barakat, Berat Kurar [1 ]
El-Sana, Jihad [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-8410501 Beer Sheva, Israel
[2] Shamoon Coll Engn, Dept Software Engn, IL-8410802 Beer Sheva, Israel
关键词
digital paleography; handwritten style analysis; Hebrew medieval manuscripts; script type classification; deep-learning based classification; convolutional neural network;
D O I
10.3390/jimaging8050143
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Paleography is the study of ancient and medieval handwriting. It is essential for understanding, authenticating, and dating historical texts. Across many archives and libraries, many handwritten manuscripts are yet to be classified. Human experts can process a limited number of manuscripts; therefore, there is a need for an automatic tool for script type classification. In this study, we utilize a deep-learning methodology to classify medieval Hebrew manuscripts into 14 classes based on their script style and mode. Hebrew paleography recognizes six regional styles and three graphical modes of scripts. We experiment with several input image representations and network architectures to determine the appropriate ones and explore several approaches for script classification. We obtained the highest accuracy using hierarchical classification approach. At the first level, the regional style of the script is classified. Then, the patch is passed to the corresponding model at the second level to determine the graphical mode. In addition, we explore the use of soft labels to define a value we call squareness value that indicates the squareness/cursiveness of the script. We show how the graphical mode labels can be redefined using the squareness value. This redefinition increases the classification accuracy significantly. Finally, we show that the automatic classification is on-par with a human expert paleographer.
引用
收藏
页数:22
相关论文
共 50 条