Interpretable Text Classification in Legal Contract Documents using Tsetlin Machines

被引:2
|
作者
Saha, Rupsa [1 ]
Jyhne, Sander [1 ]
机构
[1] Univ Agder, Dept IKT, Grimstad, Norway
关键词
Tsetlin machine; pattern recognition; interpretable AI; legal document analysis; NAMED ENTITY RECOGNITION; QUANTITATIVE-ANALYSIS;
D O I
10.1109/ISTM54910.2022.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Legal text contains various challenges in automated processing, compounded by the lack of detailed resources available for them. However, the ability of process such texts automatically is highly sought after. In this paper we try to parse a set of contract documents and identify key legal terminologies present in them, with the help of four text processing methods from different backgrounds : Tsetlin Machines, BERT, CNN-BiLSTM and FastText. We show that the TM based approach works at par with other popular methods, with the added benefit of making available important clause literals that can act as specific linguistic cues to legal terminology.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [41] Using Automatic Features for Text-image Classification in Amharic Documents
    Belay, Birhanu
    Habtegebrial, Tewodros
    Belay, Gebeyehu
    Stricker, Didier
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 440 - 445
  • [42] Automatic detection and analysis of DPP entities in legal contract documents
    Nayak, Shiva Prasad
    Pasumarthi, Suresh
    2019 FIRST INTERNATIONAL CONFERENCE ON DIGITAL DATA PROCESSING (DDP), 2019, : 70 - 75
  • [43] Toward text understanding - Classification of text documents by word map
    Visa, A
    Toivonen, J
    Back, B
    Vanharanta, H
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 299 - 305
  • [44] ProtoryNet - Interpretable Text Classification Via Prototype Trajectories
    Hong, Dat
    Wang, Tong
    Baek, Stephen
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [45] The Research of Text Preprocessing Effect on Text Documents Classification Efficiency
    Kurbatow, Andrew
    2015 INTERNATIONAL CONFERENCE "STABILITY AND CONTROL PROCESSES" IN MEMORY OF V.I. ZUBOV (SCP), 2015, : 653 - 655
  • [46] Topic Modeling for Interpretable Text Classification From EHRs
    Rijcken, Emil
    Kaymak, Uzay
    Scheepers, Floortje
    Mosteiro, Pablo
    Zervanou, Kalliopi
    Spruit, Marco
    FRONTIERS IN BIG DATA, 2022, 5
  • [47] Text and metadata extraction from scanned Arabic documents using support vector machines
    Qin, Wenda
    Elanwar, Randa
    Betke, Margrit
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (02) : 268 - 279
  • [48] REDRESS: Generating Compressed Models for Edge Inference Using Tsetlin Machines
    Maheshwari, Sidharth
    Rahman, Tousif
    Shafik, Rishad
    Yakovlev, Alex
    Rafiev, Ashur
    Jiao, Lei
    Granmo, Ole-Christoffer
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11152 - 11168
  • [49] Text Message Authorship Classification Using Kernel Support Vector Machines
    Kretchmar, Matt
    Zhao, Yifu
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 2, 2014, : 215 - 218
  • [50] Legal Holding Extraction from Italian Case Documents using Italian-LEGAL-BERT Text Summarization
    Licari, Daniele
    Bushipaka, Praveen
    Marino, Gabriele
    Comande, Giovanni
    Cucinotta, Tommaso
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 148 - 156