Interpretable Text Classification in Legal Contract Documents using Tsetlin Machines

被引:2
|
作者
Saha, Rupsa [1 ]
Jyhne, Sander [1 ]
机构
[1] Univ Agder, Dept IKT, Grimstad, Norway
关键词
Tsetlin machine; pattern recognition; interpretable AI; legal document analysis; NAMED ENTITY RECOGNITION; QUANTITATIVE-ANALYSIS;
D O I
10.1109/ISTM54910.2022.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Legal text contains various challenges in automated processing, compounded by the lack of detailed resources available for them. However, the ability of process such texts automatically is highly sought after. In this paper we try to parse a set of contract documents and identify key legal terminologies present in them, with the help of four text processing methods from different backgrounds : Tsetlin Machines, BERT, CNN-BiLSTM and FastText. We show that the TM based approach works at par with other popular methods, with the added benefit of making available important clause literals that can act as specific linguistic cues to legal terminology.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [1] Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines
    Bhattarai, Bimal
    Granmo, Ole-Christoffer
    Jiao, Lei
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17465 - 17489
  • [2] Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines
    Bimal Bhattarai
    Ole-Christoffer Granmo
    Lei Jiao
    Applied Intelligence, 2022, 52 : 17465 - 17489
  • [3] Tsetlin LOB: Realtime Regime Learning and Interpretable Prediction in Financial Limit Orderbooks using Convolutional Tsetlin Machines
    Blakely, Christian D.
    2022 INTERNATIONAL SYMPOSIUM ON THE TSETLIN MACHINE (ISTM 2022), 2022, : 13 - 20
  • [4] Classification of text documents
    Li, YH
    Jain, AK
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1295 - 1297
  • [5] Classification of text documents
    Li, YH
    Jain, AK
    COMPUTER JOURNAL, 1998, 41 (08): : 537 - 546
  • [6] Intrusion Detection with Interpretable Rules Generated Using the Tsetlin Machine
    Abeyrathna, K. Darshana
    Pussewalage, Harsha S. Gardiyawasam
    Ranasinghe, Sasanka N.
    Oleshchuk, Vladimir A.
    Granmo, Ole-Christoffer
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1121 - 1130
  • [7] On Obtaining Classification Confidence, Ranked Predictions and AUC with Tsetlin Machines
    Abeyrathna, K. Darshana
    Granmo, Ole-Christoffer
    Goodwin, Morten
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 662 - 669
  • [8] Lexicon Induction for Interpretable Text Classification
    Clos, Jeremie
    Wiratunga, Nirmalie
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 498 - 510
  • [9] Detection of Redacted Text in Legal Documents
    van Heusden, Ruben
    de Ruijter, Aron
    Majoor, Roderick
    Marx, Maarten
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, TPDL 2023, 2023, 14241 : 310 - 316
  • [10] ConvTextTM: An Explainable Convolutional Tsetlin Machine Framework for Text Classification
    Bhattarai, Bimal
    Granmo, Ole-Christoffer
    Jiao, Lei
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3761 - 3770