Natural Language Processing Applications in Case-Law Text Publishing

被引:2
|
作者
Tarasconi, Francesco [1 ]
Botros, Milad [1 ]
Caserio, Matteo [1 ]
Sportelli, Gianpiero [1 ]
Giacalone, Giuseppe [2 ]
Uttini, Carlotta [2 ]
Vignati, Luca [2 ]
Zanetta, Fabrizio [2 ]
机构
[1] CELI Language Technol, Via San Quintino 31, I-10121 Turin, Italy
[2] Giuffre Francis Lefebvre, Milan, Italy
来源
关键词
natural language processing; applications; transfer learning; language models; text classification; information extraction; publishing industry; machine learning; BERT fine-tuning; random forest; Italian language;
D O I
10.3233/FAIA200859
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Processing case-law contents for electronic publishing purposes is a time-consuming activity that encompasses several sub-tasks and usually involves adding annotations to the original text. On the other hand, recent trends in Artificial Intelligence and Natural Language Processing enable the automatic and efficient analysis of big textual data. In this paper we present our Machine Learning solution to three specific business problems, regularly met by a real world Italian publisher in their day-to-day work: recognition of legal references in text spans, new content ranking by relevance, and text classification according to a given tree of topics. Different approaches based on BERT language model were experimented with, together with alternatives, typically based on Bag-of-Words. The optimal solution, deployed in a controlled production environment, was in two out of three cases based on fine-tuned BERT (for the extraction of legal references and text classification), while, in the case of relevance ranking, a Random Forest model, with hand-crafted features, was preferred. We will conclude by discussing the concrete impact, as perceived by the publisher, of the developed prototypes.
引用
收藏
页码:154 / 163
页数:10
相关论文
共 50 条
  • [41] The Freedom of Discretion in Case-Law and Administration
    不详
    ARCHIV FUR SOZIALWISSENSCHAFT UND SOZIALPOLITIK, 1909, 28 (03): : 884 - 884
  • [42] TEMPORARY WORK IN COMMUNITY CASE-LAW
    Gonzalez del Rey Rodriguez, Ignacio
    FORO-REVISTA DE CIENCIAS JURIDICAS Y SOCIALES. NUEVAEPOCA, 2016, 19 (02): : 413 - 431
  • [43] DOCUMENTS OF TITLE - THE 1993 CASE-LAW
    KERSHEN, DL
    BUSINESS LAWYER, 1994, 49 (04): : 1839 - 1855
  • [44] An Analysis of Natural Language Text Relating to Thai Criminal Law
    Krungklang, Weerayut
    Sinthupinyo, Sukree
    PROCEEDINGS OF THE 2020 12TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2020), 2020,
  • [45] Defining authorship in a time of change: Publishing in the age of natural language processing
    Selesnick, Samuel H.
    LARYNGOSCOPE, 2023, 133 (09): : 2041 - 2041
  • [46] BERT applications in natural language processing: a review
    Gardazi, Nadia Mushtaq
    Daud, Ali
    Malik, Muhammad Kamran
    Bukhari, Amal
    Alsahfi, Tariq
    Alshemaimri, Bader
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (06)
  • [47] The Applications of Description Logics in Natural Language Processing
    Cheng Xian-Yi
    Cheng Chen
    Zhu Qian
    ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 381 - +
  • [48] Natural Language Processing: Recent Development and Applications
    Chang, Kuei-Hu
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [49] The future of natural language processing for biomedical applications
    Baud, R
    Ruch, P
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2002, 67 (1-3) : 1 - 5
  • [50] NATURAL LANGUAGE PROCESSING APPLICATIONS IN REQUIREMENTS ENGINEERING
    Lash, Alex
    Murray, Kevin
    Mocko, Gregory
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE 2012, VOL 2, PTS A AND B, 2012, : 541 - 549