Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition

被引:0
|
作者
Jain, Arihant [1 ]
Sharma, Raksha [1 ]
机构
[1] Indian Inst Technol Roorkee, Roorkee, India
关键词
Legal Domain; Pretrained Language Models; Named Entity Recognition; Conditional Random Fields;
D O I
10.1007/978-3-031-56063-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate identification of named entities is pivotal for the advancement of sophisticated legal Artificial Intelligence (AI) applications. However, the legal domain presents distinct challenges due to the presence of fine-grained, domain-specific entities, including lawyers, judges, courts, and precedents. This necessitates a nuanced approach to Named Entity Recognition (NER). In this paper, we introduce a novel NER approach tailored to the legal domain. Our system combines Robustly Optimized BERT (RoBERTa) with a Graph Convolutional Network (GCN) to harness two distinct types of complementary information related to words in the data. Furthermore, the application of a Conditional Random Field (CRF) at the output layer ensures global consistency in data labeling by considering the entire sequence when predicting a named entity. RoBERTa captures contextual information about individual words, while GCN allows us to exploit the mutual relationships between words, resulting in more precise named entity identification. Our results indicate that RoBERTa-GCN (CRF) outperforms other standard settings, such as, RoBERTa, textGCN, and BiLSTM, including state-of-the-art for NER in the legal domain.
引用
收藏
页码:261 / 267
页数:7
相关论文
共 50 条
  • [41] Combining Knowledge and CRF-Based Approach to Named Entity Recognition in Russian
    Mozharova, V. A.
    Loukachevitch, N. V.
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2016, 2017, 661 : 185 - 195
  • [42] Fine-Grained Named Entity Classification with Wikipedia Article Vectors
    Suzuki, Masatoshi
    Matsuda, Koji
    Sekine, Satoshi
    Okazaki, Naoaki
    Inui, Kentaro
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 483 - 486
  • [43] Fine-grained Named Entity Annotations for German Biographic Interviews
    Ruppenhofer, Josef
    Rehbein, Ines
    Flinz, Carolina
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4605 - 4614
  • [44] Named Entity Recognition in the Medical Domain with Constrained CRF Models
    Jochim, Charles
    Deleris, Lea A.
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 839 - 849
  • [45] BiLSTM-CRF for Persian Named-Entity Recognition
    Poostchi, Hanieh
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4427 - 4431
  • [46] LSTM-CRF for Drug-Named Entity Recognition
    Zeng, Donghuo
    Sun, Chengjie
    Lin, Lei
    Liu, Bingquan
    ENTROPY, 2017, 19 (06)
  • [47] Fine-Grained Named Entity Recognition Using a Multi-Stacked Feature Fusion and Dual-Stacked Output in Korean
    Kim, Hongjin
    Kim, Harksoo
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [48] CRF-Based Named Entity Recognition for Myanmar Language
    Mo, Hsu Myat
    Nwet, Khin Thandar
    Soe, Khin Mar
    GENETIC AND EVOLUTIONARY COMPUTING, 2017, 536 : 204 - 211
  • [49] HDCNN-CRF for Biomedical Text Named Entity Recognition
    Gao, Mingyuan
    Wei, Hao
    Chen, Fei
    Qu, Wen
    Lu, Mingyu
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 191 - 194
  • [50] LDA in Character-LSTM-CRF Named Entity Recognition
    Konopik, Miloslav
    Prazak, Ondrej
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 58 - 66