Reading Akkadian cuneiform using natural language processing

被引:10
|
作者
Gordin, Shai [1 ]
Gutherz, Gai [2 ]
Elazary, Ariel [2 ]
Romach, Avital [3 ]
Jimenez, Enrique [4 ]
Berant, Jonathan [2 ]
Cohen, Yoram [3 ]
机构
[1] Ariel Univ, Digital Humanities Ariel Lab, Fac Social Sci & Humanities, Ariel, Israel
[2] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[3] Tel Aviv Univ, Jacob M Alkow Dept Archaeol & Ancient Near Easter, Tel Aviv, Israel
[4] Ludwig Maximilians Univ Munchen, Inst Assyriol & Hittitol, Munich, Germany
来源
PLOS ONE | 2020年 / 15卷 / 10期
关键词
D O I
10.1371/journal.pone.0240511
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we present a new method for automatic transliteration and segmentation of Unicode cuneiform glyphs using Natural Language Processing (NLP) techniques. Cuneiform is one of the earliest known writing system in the world, which documents millennia of human civilizations in the ancient Near East. Hundreds of thousands of cuneiform texts were found in the nineteenth and twentieth centuries CE, most of which are written in Akkadian. However, there are still tens of thousands of texts to be published. We use models based on machine learning algorithms such as recurrent neural networks (RNN) with an accuracy reaching up to 97% for automatically transliterating and segmenting standard Unicode cuneiform glyphs into words. Therefore, our method and results form a major step towards creating a human-machine interface for creating digitized editions. Our code, Akkademia, is made publicly available for use via a web application, a python package, and a github repository.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Modeling legislation using natural language processing
    Van Gog, R
    Van Engers, TM
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 561 - 566
  • [22] READING AND LANGUAGE PROCESSING - AN INTRODUCTION
    HENDERSON, JM
    SINGER, M
    FERREIRA, F
    [J]. CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 1993, 47 (02): : 129 - 130
  • [23] Reading and language processing.
    Hyona, J
    [J]. EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 1998, 10 (04): : 443 - 446
  • [24] Reading and language processing.
    Pleh, C
    [J]. APPLIED PSYCHOLINGUISTICS, 1998, 19 (03) : 516 - 519
  • [25] Reading and language processing.
    Swaffar, J
    [J]. MODERN LANGUAGE JOURNAL, 1998, 82 (02): : 271 - 272
  • [26] Natural language processing for improving hearing-impaired student reading skills
    Quiroz Pelayo, Claudia Beatriz
    Fajardo Flores, Silvia Berenice
    Gutierrez Pulido, Jorge Rafael
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER SCIENCE (INCISCOS), 2017, : 201 - 206
  • [27] Classification of the Disposition of Patients Hospitalized with COVID-19: Reading Discharge Summaries Using Natural Language Processing
    Fernandes, Marta
    Sun, Haoqi
    Jain, Aayushee
    Alabsi, Haitham S.
    Brenner, Laura N.
    Ye, Elissa
    Ge, Wendong
    Collens, Sarah, I
    Leone, Michael J.
    Das, Sudeshna
    Robbins, Gregory K.
    Mukerji, Shibani S.
    Westover, M. Brandon
    [J]. JMIR MEDICAL INFORMATICS, 2021, 9 (02)
  • [28] Translating Speech to Indian Sign Language Using Natural Language Processing
    Sharma, Purushottam
    Tulsian, Devesh
    Verma, Chaman
    Sharma, Pratibha
    Nancy, Nancy
    [J]. FUTURE INTERNET, 2022, 14 (09):
  • [29] Survey on Spell Checker for Tamil Language Using Natural Language Processing
    Selvaraj, P. A.
    Jagadeesan, M.
    Harikrishnan, M.
    Vijayapriya, R.
    Jayasudha, K.
    [J]. JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 170 - 174
  • [30] Second language learning system on the WWW using natural language processing
    Dansuwan, S
    Nishina, K
    Akahori, K
    [J]. PROCEEDINGS OF ICCE'98, VOL 1 - GLOBAL EDUCATION ON THE NET, 1998, : 599 - 605