Fact-based similar case retrieval methods based on statutory knowledge

被引:0
|
作者
Li L. [1 ]
Wang D. [2 ]
Fan H. [2 ]
机构
[1] Guanghua Law School, Zhejiang University, Hangzhou
[2] School of Information Management for Law, China University of Political Science and Law, Beijing
关键词
deep learning; event logic graph; legal logic; similar case retrieval; statutory knowledge;
D O I
10.3785/j.issn.1008-973X.2024.07.005
中图分类号
学科分类号
摘要
Existing research on the retrieval task of similar cases ignores the legal logic that the model should imply, and cannot adapt to the requirements of case similarity criteria in practical applications. Few datasets in Chinese for case retrieval tasks are difficult to meet the research needs. A similar case retrieval model was proposed based on legal logic and strong interpretability, and a case event logic graph was constructed based on predicate verbs. The statutory knowledge corresponding to various crimes was integrated into the proposed model, and the extracted elements were input to a neural network-based scorer to realize the task of case retrieval accurately and efficiently. A Confusing-LeCaRD dataset was built for the case retrieval task with a confusing group of charges as the main retrieval causes. Experiments show that the normalized discounted cumulative gain of the proposed model on the LeCaRD dataset and Confusing-LeCaRD dataset was 90.95% and 94.64%, and the model was superior to TF-IDF, BM25 and BERT-PLI in all indicators. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:1357 / 1365
页数:8
相关论文
共 23 条
  • [1] BHATTACHARYA P, GHOSH K, PAL A, Et al., Methods for computing legal document similarity: a comparative study
  • [2] WAGH R S, ANAND D., Legal document similarity: a multicriteria decision-making perspective, PeerJ Computer Science, 6, (2020)
  • [3] TRAN V, NGUYEN M L, SATOH K., Building legal case retrieval systems with lexical matching and summarization using a pre-trained phrase scoring model [C], Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law, pp. 275-282, (2019)
  • [4] JIANG J Y, ZHANG M, LI C, Et al., Semantic text matching for long-form documents [C], The World Wide Web Conference, pp. 795-806, (2019)
  • [5] SHAO Y, MAO J, LIU Y, Et al., BERT-PLI: modeling paragraph-level interactions for legal case retrieval [C], International Joint Conference on Artificial Intelligence, pp. 3501-3507, (2020)
  • [6] ALI B, MORE R, PAWAR S, Et al., Prior case retrieval using evidence extraction from court judgements [C], The Fifth Workshop on Automated Semantic Analysis of Information in Legal Text, pp. 1-11, (2021)
  • [7] MA Y, SHAO Y, WU Y, Et al., LeCaRD: a legal case retrieval dataset for Chinese law system [C], Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2342-2348, (2021)
  • [8] YOSHIOKA M, KANO Y, KIYOTA N, Et al., Overview of japanese statute law retrieval and entailment task at COLIEE-2018 [C], Proceedings of the Twelfth International Workshop on Juris-Informatics, pp. 117-128, (2018)
  • [9] ZHAO Jingsheng, SONG Mengxue, GAO Xiang, Et al., Research on text representation in natural language processing [J], Journal of Software, 33, 1, pp. 102-128, (2022)
  • [10] WEI L, ZHOU C, SU R, Et al., PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning [J], Bioinformatics, 35, 21, pp. 4272-4280, (2019)