Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition

被引:0
|
作者
Jain, Arihant [1 ]
Sharma, Raksha [1 ]
机构
[1] Indian Inst Technol Roorkee, Roorkee, India
关键词
Legal Domain; Pretrained Language Models; Named Entity Recognition; Conditional Random Fields;
D O I
10.1007/978-3-031-56063-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate identification of named entities is pivotal for the advancement of sophisticated legal Artificial Intelligence (AI) applications. However, the legal domain presents distinct challenges due to the presence of fine-grained, domain-specific entities, including lawyers, judges, courts, and precedents. This necessitates a nuanced approach to Named Entity Recognition (NER). In this paper, we introduce a novel NER approach tailored to the legal domain. Our system combines Robustly Optimized BERT (RoBERTa) with a Graph Convolutional Network (GCN) to harness two distinct types of complementary information related to words in the data. Furthermore, the application of a Conditional Random Field (CRF) at the output layer ensures global consistency in data labeling by considering the entire sequence when predicting a named entity. RoBERTa captures contextual information about individual words, while GCN allows us to exploit the mutual relationships between words, resulting in more precise named entity identification. Our results indicate that RoBERTa-GCN (CRF) outperforms other standard settings, such as, RoBERTa, textGCN, and BiLSTM, including state-of-the-art for NER in the legal domain.
引用
收藏
页码:261 / 267
页数:7
相关论文
共 50 条
  • [31] Arabic Named Entity Recognition: A Bidirectional GRU-CRF Approach
    Gridach, Mourad
    Haddad, Hatem
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 264 - 275
  • [32] A CRF based Machine Learning Approach for Biomedical Named Entity Recognition
    Kanimozhi, U.
    Manjula, D.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 335 - 342
  • [33] LSTM-CRF Models for Named Entity Recognition
    Lee, Changki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (04): : 882 - 887
  • [34] Entity Retrieval Using Fine-Grained Entity Aspects
    Chatterjee, Shubham
    Dietz, Laura
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1662 - 1666
  • [35] Chinese Cyber Threat Intelligence Named Entity Recognition via RoBERTa-wwm-RDCNN-CRF
    Zhen, Zhen
    Gao, Jian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 299 - 323
  • [36] A Named Entity Recognition Approach for Albanian
    Skenduli, Marjana Prifti
    Biba, Marenglen
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1532 - 1537
  • [37] A New Approach for Named Entity Recognition
    Ertopcu, Burak
    Kanburoglu, Ali Bugra
    Topsakal, Ozan
    Acikgoz, Onur
    Gurkan, Ali Tunca
    Ozenc, Berke
    Cam, Ilker
    Avar, Begum
    Ercan, Gokhan
    Yildiz, Olcay Taner
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 474 - 479
  • [38] The ConceptMapper Approach to Named Entity Recognition
    Tanenblatt, Michael
    Coden, Anni
    Sominsky, Igor
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [39] Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition
    Chen, Chun
    Kong, Fang
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 20 - 25
  • [40] Named Entity Recognition for Setswana Language: A Conditional Random Fields (CRF) Approach
    Okgetheng, Boago
    Malema, Gabofetswe
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 240 - 244