Interpretable Text Classification in Legal Contract Documents using Tsetlin Machines

被引：2

作者：

Saha, Rupsa ^{[1
]}

Jyhne, Sander ^{[1
]}

机构：

[1] Univ Agder, Dept IKT, Grimstad, Norway

来源：

2022 INTERNATIONAL SYMPOSIUM ON THE TSETLIN MACHINE (ISTM 2022) | 2022年

关键词：

Tsetlin machine; pattern recognition; interpretable AI; legal document analysis; NAMED ENTITY RECOGNITION; QUANTITATIVE-ANALYSIS;

D O I：

10.1109/ISTM54910.2022.00011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Legal text contains various challenges in automated processing, compounded by the lack of detailed resources available for them. However, the ability of process such texts automatically is highly sought after. In this paper we try to parse a set of contract documents and identify key legal terminologies present in them, with the help of four text processing methods from different backgrounds : Tsetlin Machines, BERT, CNN-BiLSTM and FastText. We show that the TM based approach works at par with other popular methods, with the added benefit of making available important clause literals that can act as specific linguistic cues to legal terminology.

引用

页码：7 / 12

页数：6

共 50 条

[1] Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines
Bhattarai, Bimal
Granmo, Ole-Christoffer
Jiao, Lei
APPLIED INTELLIGENCE, 2022, 52 (15) : 17465 - 17489
[2] Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines
Bimal Bhattarai
Ole-Christoffer Granmo
Lei Jiao
Applied Intelligence, 2022, 52 : 17465 - 17489
[3] Tsetlin LOB: Realtime Regime Learning and Interpretable Prediction in Financial Limit Orderbooks using Convolutional Tsetlin Machines
Blakely, Christian D.
2022 INTERNATIONAL SYMPOSIUM ON THE TSETLIN MACHINE (ISTM 2022), 2022, : 13 - 20
[4] Classification of text documents
Li, YH
Jain, AK
FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1295 - 1297
[5] Classification of text documents
Li, YH
Jain, AK
COMPUTER JOURNAL, 1998, 41 (08): : 537 - 546
[6] Intrusion Detection with Interpretable Rules Generated Using the Tsetlin Machine
Abeyrathna, K. Darshana
Pussewalage, Harsha S. Gardiyawasam
Ranasinghe, Sasanka N.
Oleshchuk, Vladimir A.
Granmo, Ole-Christoffer
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1121 - 1130
[7] On Obtaining Classification Confidence, Ranked Predictions and AUC with Tsetlin Machines
Abeyrathna, K. Darshana
Granmo, Ole-Christoffer
Goodwin, Morten
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 662 - 669
[8] Lexicon Induction for Interpretable Text Classification
Clos, Jeremie
Wiratunga, Nirmalie
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 498 - 510
[9] Detection of Redacted Text in Legal Documents
van Heusden, Ruben
de Ruijter, Aron
Majoor, Roderick
Marx, Maarten
LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, TPDL 2023, 2023, 14241 : 310 - 316
[10] ConvTextTM: An Explainable Convolutional Tsetlin Machine Framework for Text Classification
Bhattarai, Bimal
Granmo, Ole-Christoffer
Jiao, Lei
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3761 - 3770

← 1 2 3 4 5 →