Legal sentence boundary detection using hybrid deep learning and statistical models

被引:0
|
作者
Sheik, Reshma [1 ]
Ganta, Sneha Rao [1 ]
Nirmala, S. Jaya [1 ]
机构
[1] Natl Inst Technol Trichy, Tiruchirappalli, Tamil Nadu, India
关键词
Natural language processing; Sentence boundary detection; Deep learning; Transformer; LegalBERT; CaseLawBERT; CNN; CRF;
D O I
10.1007/s10506-024-09394-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentence boundary detection (SBD) represents an important first step in natural language processing since accurately identifying sentence boundaries significantly impacts downstream applications. Nevertheless, detecting sentence boundaries within legal texts poses a unique and challenging problem due to their distinct structural and linguistic features. Our approach utilizes deep learning models to leverage delimiter and surrounding context information as input, enabling precise detection of sentence boundaries in English legal texts. We evaluate various deep learning models, including domain-specific transformer models like LegalBERT and CaseLawBERT. To assess the efficacy of our deep learning models, we compare them with a state-of-the-art domain-specific statistical conditional random field (CRF) model. After considering model size, F1-score, and inference time, we identify the Convolutional Neural Network Model (CNN) as the top-performing deep learning model. To further enhance performance, we integrate the features of the CNN model into the subsequent CRF model, creating a hybrid architecture that combines the strengths of both models. Our experiments demonstrate that the hybrid model outperforms the baseline model, achieving a 4% improvement in the F1-score. Additional experiments showcase the superiority of the hybrid model over SBD open-source libraries when confronted with an out-of-domain test set. These findings underscore the importance of efficient SBD in legal texts and emphasize the advantages of employing deep learning models and hybrid architectures to achieve optimal performance.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] Fake News Detection Using Hybrid Deep Learning Method
    Yadav A.K.
    Kumar S.
    Kumar D.
    Kumar L.
    Kumar K.
    Maurya S.K.
    Kumar M.
    Yadav D.
    SN Computer Science, 4 (6)
  • [42] Wheel Defect Detection Using a Hybrid Deep Learning Approach
    Shaikh, Khurram
    Hussain, Imtiaz
    Chowdhry, Bhawani Shankar
    SENSORS, 2023, 23 (14)
  • [43] Real-world sentence boundary detection using multitask learning: A case study on French
    Lim, KyungTae
    Park, Jungyeul
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (01) : 150 - 170
  • [44] Polarity Detection of Dialectal Arabic using Deep Learning Models
    Mohamed, Saleh M.
    Mohamed, Ensaf Hussein
    Belal, Mohamed A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 212 - 218
  • [45] Phishing URL Detection using Deep Learning with CNN Models
    Alsadig, Alsadig Hadi
    Ahmad, Md Ogail
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 768 - 775
  • [46] Astronomical Object Shape Detection Using Deep Learning Models
    Mohanasundaram, K.
    Balasaranya, K.
    Priya, J. Geetha
    Ruchitha, B.
    Priya, A. Vishnu
    Harshini, Hima
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 7867 - 7874
  • [47] Detection of Mulberry Ripeness Stages Using Deep Learning Models
    Miraei Ashtiani, Seyed-Hassan
    Javanmardi, Shima
    Jahanbanifard, Mehrdad
    Martynenko, Alex
    Verbeek, Fons J.
    IEEE ACCESS, 2021, 9 : 100380 - 100394
  • [48] Monkeypox Skin Lesion Detection Using Deep Learning Models
    Gurbuz, Selen
    Aydin, Galip
    2022 INTERNATIONAL CONFERENCE ON COMPUTERS AND ARTIFICIAL INTELLIGENCE TECHNOLOGIES, CAIT, 2022, : 66 - 70
  • [49] Design and Analysis of Intrusion Detection Using Deep Learning Models
    Modak, Abhishek
    Dehalwar, Vasudev
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [50] Detection of bruises on red apples using deep learning models
    Unal, Zeynep
    Kizildeniz, Tefide
    Ozden, Mustafa
    Aktas, Hakan
    Karagoz, Omer
    SCIENTIA HORTICULTURAE, 2024, 329