Extracting Definitions from Brazilian Legal Texts

被引:0
|
作者
Ferneda, Edilson [1 ]
do Prado, Hercules Antonio [1 ,2 ]
Batista, Augusto Herrmann [1 ,3 ]
Pinheiro, Marcello Sandi [4 ]
机构
[1] Univ Catolica Brasilia, Grad Program Knowledge & IT Management, SGAN 916 Av W5, BR-70790160 Brasilia, DF, Brazil
[2] Embrapa Management & Strategy Secretariat, BR-7077090 Brasilia, DF, Brazil
[3] Minist Planning Budget & Management, Logist & Informat Technol Secretariat, BR-70046900 Brasilia, DF, Brazil
[4] Univ Fed Rio de Janeiro, COPPE, BR-2194197 Rio De Janeiro, RJ, Brazil
关键词
Information extraction; Definition extraction; Natural Language Processing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In order to avoid ambiguity and to ensure, as far as possible, a strict interpretation of law, legal texts usually define the specific lexical terms used within their discourse by means of normative rules. With an often large amount of rules in effect in a given domain, extracting these definitions manually would be a costly undertaking. This paper presents an approach to cope with this problem based in a variation of an automated technique of natural language processing of Brazilian Portuguese texts. For the sake of generality, the proposed solution was developed to address the more general problem of building a glossary from domain specific texts that contain definitions amongst their content. This solution was applied to a corpus of texts on the telecommunications regulations domain and the results are reported. The usual pipeline of natural language processing has been followed: preprocessing, segmentation, and part-of-speech tagging. A set of feature extraction functions is specified and used along with reference glossary information on whether or not a text fragment is a definition, to train a SVM classifier. At last, the definitions are extracted from the texts and evaluated upon a testing corpus, which also contains the reference glossary annotations on definitions. The results are then discussed in light of other definition extraction techniques.
引用
收藏
页码:631 / 646
页数:16
相关论文
共 50 条
  • [1] Extracting Semantic Annotations from Legal Texts
    Lesmo, Leonardo
    Mazzei, Alessandro
    Radicioni, Daniele P.
    20TH ACM CONFERENCE ON HYPERTEXT AND HYPERMEDIA (HYPERTEXT 2009), 2009, : 167 - 171
  • [2] Prospects for Legal Analytics: Some Approaches to Extracting More Meaning from Legal Texts
    Ashley, Kevin D.
    UNIVERSITY OF CINCINNATI LAW REVIEW, 2022, 90 (04) : 1206 - 1240
  • [3] A Query System for Extracting Requirements-related Information from Legal Texts
    Sleimi, Amin
    Ceci, Marcello
    Sannier, Nicolas
    Sabetzadeh, Mehrdad
    Briand, Lionel C.
    Dann, John
    2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, : 319 - 329
  • [4] An Ontology-Based and Deep Learning-Driven Method for Extracting Legal Facts from Chinese Legal Texts
    Ren, Yong
    Han, Jinfeng
    Lin, Yingcheng
    Mei, Xiujiu
    Zhang, Ling
    ELECTRONICS, 2022, 11 (12)
  • [5] Extracting Legal Norm Analysis Categories from German Law Texts with Large Language Models
    Bachinger, Sarah T.
    Feddoul, Leila
    Mauch, Marianne
    Koenig-Ries, Birgitta
    PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2024, 2024, : 481 - 493
  • [6] Extracting Information from Archaeological Texts
    Kintigh, Keith W.
    OPEN ARCHAEOLOGY, 2015, 1 (01): : 96 - 101
  • [7] Extracting Appraisal Expressions from Short Texts
    Jin, Peiquan
    Yu, Yongbo
    Zhao, Jie
    Yue, Lihua
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 481 - 485
  • [8] Extracting candidate terms from medical texts
    Bentounsi, Imene
    Boufaida, Zizette
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [9] Towards extracting semantic information from texts
    Trandabat, Diana
    13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 199 - 206
  • [10] Extracting Dependency Trees from Sanskrit Texts
    Hellwig, Oliver
    SANSKRIT COMPUTATIONAL LINGUISTICS, 2009, 5406 : 106 - 115