Named Entity Recognition for Nepali Language

被引:7
|
作者
Singh, Oyesh Mann [1 ]
Padia, Ankur [1 ]
Joshi, Anupam [1 ]
机构
[1] UMBC, Dept CSEE, Baltimore, MD 21250 USA
关键词
Named Entity Recognition; Nepali; Low-resource; BiLSTM; CNN; Grapheme;
D O I
10.1109/CIC48465.2019.00031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) has been studied for many languages like English, German, Spanish, and others but virtually no studies have focused on the Nepali language. One key reason is the lack of an appropriate, annotated dataset. In this paper, we describe a Nepali NER dataset that we created. We discuss and compare the performance of various machine learning models on this dataset. We also propose a novel NER scheme for Nepali and show that this scheme, based on grapheme-level representations, outperforms character-level representations when combined with BiLSTM models. Our best models obtain an overall F1 score of 86.89, which is a significant improvement on previously reported performance in literature.
引用
收藏
页码:184 / 190
页数:7
相关论文
共 50 条
  • [1] Named Entity Recognition (NER) for Nepali
    Maharjan, Gopal
    Bal, Bal Krishna
    Regmi, Santosh
    [J]. CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, PT II, 2019, 1084 : 71 - 80
  • [2] Named Entity Recognition for Mongolian Language
    Munkhjargal, Zoljargal
    Bella, Gabor
    Chagnaa, Altangerel
    Giunchiglia, Fausto
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 243 - 251
  • [3] Named Entity Recognition in Marathi Language
    Kale, Shrutika
    Govilkar, Sharvari
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 371 - 377
  • [4] Named Entity Recognition for Sinhala Language
    Dahanayaka, J. K.
    Weerasinghe, A. R.
    [J]. 14TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) 2014, 2014, : 215 - 220
  • [5] Named entity recognition for the Kazakh language
    Kozhirbayev, Z. M.
    Yessenbayev, Z. A.
    [J]. JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2020, 107 (03): : 57 - 66
  • [6] Named Entity Recognition for the Azerbaijani Language
    Akhundova, Natavan
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [7] Supervised Named Entity Recognition in Assamese language
    Talukdar, Gitimoni
    Borah, Pranjal Protim
    Baruah, Arup
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 187 - 191
  • [8] Named Entity Recognition System for Sindhi Language
    Jumani, Awais Khan
    Memon, Mashooque Ahmed
    Khoso, Fida Hussain
    Sanjrani, Anwar Ali
    Soomro, Safeeullah
    [J]. EMERGING TECHNOLOGIES IN COMPUTING, ICETIC 2018, 2018, 200 : 237 - 246
  • [9] Named Entity Recognition and Classification for Gujarati Language
    Vora, Komil
    Vasant, Avani
    Adhvaryu, Rachit
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2269 - 2272
  • [10] A LANGUAGE INDEPENDENT NAMED ENTITY RECOGNITION SYSTEM
    Gifu, Daniela
    Vasilache, Gabriela
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2014, 2014, : 181 - 188