On Creating and Using Text of the Russian Federation Corpus of Legal Acts as an Open Dataset

被引:3
|
作者
Saveliev, Denis [1 ]
机构
[1] European Univ St Petersburg, Inst Implementing Law, 6-1 Gagarinskaya Str, St Petersburg 191887, Russia
来源
关键词
legal information; legislation; open data; dataset; XML; legal act; machine-readable corpus; computer linguistics; text as data;
D O I
10.17323/2072-8166.2018.1.26.44
中图分类号
D9 [法律]; DF [法律];
学科分类号
0301 ;
摘要
Methods of computer-aided text analysis that are currently being developed can be useful for research in legal science and in practice. An obvious requirement for such an analysis is the availability of an open and structured corpus of texts. The article presents such a corpus of texts of legal acts of federal and regional legislation in a machine-readable form (of a dataset) RusLawOD. It is publicly available on the Github Internet portal. The created data set is based on open sources of legal acts, primarily on the data of the Official Internet Portal of Legal Information (pravo.gov.ru) as a result of integration of open data about published officially legal acts and the Zakonodatelstvo Rossii legal information system. The main research issue in the field of law in the development of this resource was the question how to publish the texts of legal acts and metadata about them. It is necessary to come on a nationwide scale to the general standard for the description of legal acts in machine-readable form for the possibilities of data exchange between different information systems. To do this, we need to determine the uniform name of the attributes that identify the document, as well as its internal structure. The article suggests solutions that can be taken as a basis for this. In addition to describing the data, examples are given how the data presented can help in solving research legal problems. Such examples are the classification of legal acts and the definition of the frequency of collocations of certain terms. On the basis of analysis of metadata about documents published in the official site, the classifier of really used themes was reconstructed, and theme usage was counted. The author compares existing classification of legal acts and the use of methods of computer linguistics to determine the most frequently used subjects in legislation, coming to the conclusion that modern methods of computer-based text analysis make it possible to get valuable and proven results.
引用
收藏
页码:26 / 44
页数:19
相关论文
共 14 条
  • [1] LEGAL STATUS OF ACTS OF THE PUBLIC CHAMBER OF THE RUSSIAN FEDERATION
    Nazarov, A. P.
    [J]. MORDOVIA UNIVERSITY BULLETIN, 2009, 4 : 80 - 82
  • [2] The loss of legal force of acts of official interpretation in the Russian Federation
    Dychko, A., V
    [J]. LEGAL SCIENCE AND PRACTICE-BULLETIN OF NIZHNIY NOVGOROD ACADEMY OF THE MINISTRY IF THE INTERIOR OF RUSSIA, 2011, 14 (01): : 345 - 348
  • [3] Ontological Model of Legal Norms for Creating and Using Legal Acts
    Gostojic, Stevan L.
    Milosavljevic, Branko P.
    [J]. IPSI BGD TRANSACTIONS ON INTERNET RESEARCH, 2013, 9 (01): : 19 - 25
  • [4] ACTS ISSUED BY THE PRESIDENT OF THE RUSSIAN FEDERATION AND FOREIGN POLITICAL LEADERS: LEGAL ANALYSIS
    Loshkarev, Igor Olegovich
    Tchinaryan, Elena Olegovna
    Lutovinova, Natalya Viktorovna
    [J]. REVISTA INCLUSIONES, 2020, 7 : 108 - 120
  • [5] Initiative expert report on the draft of the federal law ''On normative legal acts of the Russian Federation''
    Baranov, V. M.
    Krasilnikova, N. A.
    Lavrentyev, A. R.
    [J]. LEGAL SCIENCE AND PRACTICE-BULLETIN OF NIZHNIY NOVGOROD ACADEMY OF THE MINISTRY IF THE INTERIOR OF RUSSIA, 2015, 29 (01): : 381 - 384
  • [6] Creating a Corpus for Russian Data-to-Text Generation Using Neural Machine Translation and Post-Editing
    Shimorina, Anastasia
    Khasanova, Elena
    Gardent, Claire
    [J]. 7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 44 - 49
  • [7] THE ROLE OF INTERNATIONAL LEGAL ACTS IN LEGISLATIVE AND EXECUTIVE PROCESS OF CRIMINAL EXECUTIVE SYSTEM IN RUSSIAN FEDERATION
    Sizaya, E. A.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2008, (313): : 123 - +
  • [8] Criminal Prosecution of Persons, Who Committed Criminal, Acts Using the Cryptocurrency in the Russian Federation
    Pushkarev, Viktor Victorovich
    Artemova, Valeriia Valerievna
    Ermakov, Sergey Vyacheslavovich
    Alimamedov, Elmir Nizamievich
    Popenkov, Anton Valerevich
    [J]. REVISTA SAN GREGORIO, 2020, (42): : 330 - 334
  • [9] THE OPEN REVOLUTION: USING CITATION ANALYSIS TO IMPROVE LEGAL TEXT RETRIEVAL
    Geist, Anton
    [J]. EUROPEAN JOURNAL OF LEGAL STUDIES, 2010, 2 (03): : 137 - 145
  • [10] Using Open Data for Information Support of Simulation Model of the Russian Federation Spatial Development
    Mashkova, Aleksandra L.
    Savina, Olga A.
    Banchuk, Yuriy A.
    Mashkov, Evgeniy A.
    [J]. ELECTRONIC GOVERNANCE AND OPEN SOCIETY: CHALLENGES IN EURASIA, EGOSE 2018, 2019, 947 : 401 - 414