Effective deep learning approaches for summarization of legal texts

被引:26
|
作者
Anand, Deepa [1 ]
Wagh, Rupali [2 ]
机构
[1] CMR Inst Technol, Bangalore 560037, Karnataka, India
[2] JAIN Deemed Univ, Bangalore 560004, Karnataka, India
关键词
Legal text summarization; Natural language processing; Deep learning; Sentence embeddings;
D O I
10.1016/j.jksuci.2019.11.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of legal judgment documents in digital form offers numerous opportunities for information extraction and application. Automatic summarization of these legal texts is a crucial and a challenging task due to the unusual structure and high complexity of these documents. Previous approaches in this direction have relied on huge labelled datasets, using hand engineered features, leveraging on domain knowledge and focussed their attention on a narrow sub-domain for increased effectiveness. In this paper, we propose simple generic techniques using neural network for the summarization task for Indian legal judgment documents. We explore two neural network architectures for this task utilizing the word and sentence embeddings for capturing the semantics. The main advantage of the proposed approaches is that they do not rely on hand crafted features, or domain specific knowledge, nor is their application restricted to a particular sub-domain thus making them suitable to be extended to other domains as well. We tackle the problem of unavailability of labelled data for the task by assigning classes/scores to sentences in the training set, based on their match with reference summary produced by humans. The experimental evaluations establish the effectiveness of our proposed approaches as compared with other baselines. (C) 2019 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:2141 / 2150
页数:10
相关论文
共 50 条
  • [1] RulingBR: A Summarization Dataset for Legal Texts
    Feijo, Diego de Vargas
    Moreira, Viviane Pereira
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 255 - 264
  • [2] A Survey of Text Summarization Approaches Based on Deep Learning
    Sheng-Luan Hou
    Xi-Kun Huang
    Chao-Qun Fei
    Shu-Han Zhang
    Yang-Yang Li
    Qi-Lin Sun
    Chuan-Qing Wang
    Journal of Computer Science and Technology, 2021, 36 : 633 - 663
  • [3] A Survey of Text Summarization Approaches Based on Deep Learning
    Hou, Sheng-Luan
    Huang, Xi-Kun
    Fei, Chao-Qun
    Zhang, Shu-Han
    Li, Yang-Yang
    Sun, Qi-Lin
    Wang, Chuan-Qing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 36 (03) : 633 - 663
  • [4] Deep Learning-Based Abstractive Summarization for Brazilian Portuguese Texts
    Palola, Pedro H.
    de Rose, Gustavo H.
    Papa, Joao P.
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 479 - 493
  • [5] Revisiting Information Retrieval and Deep Learning Approaches for Code Summarization
    Zhu, Tingwei
    Li, Zhong
    Pan, Minxue
    Shi, Chaoxuan
    Zhang, Tian
    Pei, Yu
    Li, Xuandong
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 328 - 329
  • [6] Investigation of the Deep Learning Approaches to Classify Emotions in Texts
    Nazarenko, Dmytro
    Afanasieva, Iryna
    Goliana, Nataliia
    Golian, Vira
    COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [7] Deep Is Better? An Empirical Comparison of Information Retrieval and Deep Learning Approaches to Code Summarization
    Zhu, Tingwei
    Li, Zhong
    Pan, Minxue
    Shi, Chaoxuan
    Zhang, Tian
    Pei, Yu
    Li, Xuandong
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
  • [8] Abstractive Multi-document Summarization Using Deep Learning Approaches
    Poornima, Murkute
    Pulipati, Venkateswara Rao
    Kumar, T. Sunil
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 57 - 68
  • [9] Exploring Deep Learning Approaches to Recognize Handwritten Arabic Texts
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    IEEE ACCESS, 2020, 8 : 89882 - 89898
  • [10] A Comparison of Multiple Approaches for the Extractive Summarization of Portuguese Texts
    Costa, Miguel
    Martins, Bruno
    LINGUAMATICA, 2015, 7 (01): : 23 - 40