Deep recurrent neural networks with word embeddings for Urdu named entity recognition

被引:20
|
作者
Khan, Wahab [1 ]
Daud, Ali [1 ]
Alotaibi, Fahd [2 ]
Aljohani, Naif [2 ]
Arafat, Sachi [2 ]
机构
[1] Int Islamic Univ, Dept Comp Sci & Software Engn, Islamabad, Pakistan
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
conditional random fields; deep recurrent neural network; machine learning; named entity recognition; Urdu;
D O I
10.4218/etrij.2018-0553
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Named entity recognition (NER) continues to be an important task in natural language processing because it is featured as a subtask and/or subproblem in information extraction and machine translation. In Urdu language processing, it is a very difficult task. This paper proposes various deep recurrent neural network (DRNN) learning models with word embedding. Experimental results demonstrate that they improve upon current state-of-the-art NER approaches for Urdu. The DRRN models evaluated include forward and bidirectional extensions of the long short-term memory and back propagation through time approaches. The proposed models consider both language-dependent features, such as part-of-speech tags, and language-independent features, such as the "context windows" of words. The effectiveness of the DRNN models with word embedding for NER in Urdu is demonstrated using three datasets. The results reveal that the proposed approach significantly outperforms previous conditional random field and artificial neural network approaches. The best f-measure values achieved on the three benchmark datasets using the proposed deep learning approaches are 81.1%, 79.94%, and 63.21%, respectively.
引用
收藏
页码:90 / 100
页数:11
相关论文
共 50 条
  • [1] A deep neural framework for named entity recognition with boosted word embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 15533 - 15546
  • [2] A deep neural framework for named entity recognition with boosted word embeddings
    Archana Goyal
    Vishal Gupta
    Manish Kumar
    [J]. Multimedia Tools and Applications, 2024, 83 : 15533 - 15546
  • [3] Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition
    Unanue, Inigo Jauregi
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 76 : 102 - 109
  • [4] Recurrent Neural Network-Based Model for Named Entity Recognition with Improved Word Embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    [J]. IETE JOURNAL OF RESEARCH, 2023, 69 (10) : 6970 - 6976
  • [5] Chemical Named Entity Recognition with Deep Contextualized Neural Embeddings
    Awan, Zainab
    Kahlke, Tim
    Ralph, Peter J.
    Kennedy, Paul J.
    [J]. KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 135 - 144
  • [6] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    [J]. BIOINFORMATICS, 2017, 33 (14) : I37 - I48
  • [7] Recurrent neural networks for Turkish named entity recognition
    Gungor, Onur
    Uskudarli, Suzan
    Gungor, Tunga
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [8] Named Entity Recognition Only from Word Embeddings
    Luo, Ying
    Zhao, Hai
    Zhan, Junlang
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8995 - 9005
  • [9] Combining Word Embeddings for Portuguese Named Entity Recognition
    da Silva, Messias Gomes
    Alves de Oliveira, Hilario Tomaz
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 198 - 208
  • [10] Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 495 - 500