Towards Automatic Structuring and Semantic Indexing of Legal Documents

被引:7
|
作者
Koniaris, Marios [1 ]
Papastefanatos, George [2 ]
Vassiliou, Yannis [1 ]
机构
[1] Natl Tech Univ Athens, KDBS Lab, Sch ECE, Athens, Greece
[2] Athena Res Ctr, Inst Management Informat Syst, Maroussi, Greece
关键词
Legislation; legal text analysis; natural language processing; WEB;
D O I
10.1145/3003733.3003801
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the last years there has been a great increase on the number of freely available legal resources. Portals that allow users to search for legislation, using keywords are now a common place. However, in the vast majority of those portals, legal documents are not stored in a structured format with a rich set of meta data, but in presentation oriented manifestation, making impossible for the end users to inquiry semantics about the documents, such as date of enactment, date of repeal, jurisdiction, etc. or to reuse information and establish an interconnection with similar repositories. In this paper, we present an approach for extracting a machine readable semantic representation of legislation, from unstructured document formats. Our method exploits common formats of legal documents to identify blocks of structural and semantic information and models them according to a popular legal meta-schema. Our proposed method is highly extensible and achieves high accuracy for a variety of legal and para legal documents, especially legislation. Our evaluation results reveal that our methodology can be of great assistance for the automatic structuring and semantic indexing of legal resources.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Automatic Indexing of Journal Abstracts with Latent Semantic Analysis
    Adams, Joel Robert
    Bedrick, Steven
    [J]. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 200 - 208
  • [32] Automatic text summarization based on latent semantic indexing
    Ai, Dongmei
    Zheng, Yuchao
    Zhang, Dezheng
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2010, 15 (01) : 25 - 29
  • [33] Research of Automatic Indexing Based on Semantic and Statistic Feature
    Zhang Yue
    Lv Xueqiang
    Shi Shuicai
    Wang Hongwei
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 77 - 80
  • [34] Towards automatic semantic integration
    Lukacsy, Gergely
    Szeredi, Peter
    Benko, Tamas
    [J]. ENTERPRISE INTEROPERABILITY II: NEW CHALLENGES AND APPROACHES, 2007, : 795 - 806
  • [35] The automatic generation of hypertext links in legal documents
    Schweighofer, E
    Scheithauer, D
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, 1996, 1134 : 889 - 898
  • [36] Automatic indexing of documents from journal descriptors: A preliminary investigation
    Humphrey, SM
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (08): : 661 - 674
  • [37] Design and implementation of automatic indexing for information retrieval with Arabic documents
    Hmeidi, I
    Kanaan, G
    Evens, M
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1997, 48 (10): : 867 - 881
  • [38] A combining approach to automatic keyphrases indexing for chinese news documents
    Wang, HF
    Li, SJ
    Yu, SW
    Kang, BK
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 441 - 444
  • [39] Automatic indexing of health documents in French: Evaluating and analysing errors
    Chebil, W.
    Soualmia, L. F.
    Dahamna, B.
    Darmoni, S. J.
    [J]. IRBM, 2012, 33 (5-6) : 316 - 329
  • [40] Automatic Indexing of Scanned Documents - a Layout-based Approach
    Esser, Daniel
    Schuster, Daniel
    Muthmann, Klemens
    Berger, Michael
    Schill, Alexander
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XIX, 2012, 8297